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GENES FROM PROPIONIBACTERIU M FREUD ENREICHII ENCODING ENZY MES INVOLVED IN VITAMIN 
B12 BIOSYNTHESIS ' * 

Field of Invention 

10 

This invention relates to genes and thie proteins (such as enzymes) involved in 
the biosynthetic pathway of vitamin Bi 2 . In particular, the invention provides four 
novel genes (and their corresponding encoded enzymes), all derived from 
Propionibacteria, in particular Propionibacterium freudenreichii. These enzymes 
15 are either synthases or transferases, and can be used in the manufacture of vitamin 
B12. 

Introduction 

20 Vitamin Bi 2 is an important vitamin for humans and animals. It is an essential 

vitamin, and is obtained from foodstuffs in the human and animal diet. Vitamin Bi 2 
is naturally found in animal foods including fish, milk and milk products, eggs, meat 
and poultry. Certain foodstuffs, for example breakfast cereals, are fortified in 
vitamin B i2 , and provide a particularly valuable source of the vitamin for vegetarians. 

25 Vitamin Bi 2 is used to treat pernicious anaemia and peripheral neuritis, and is also 
used as a supplement to animal feeds. 

The term vitamin Bi 2 is used to describe compounds of the cobalt corrinoid 
family, in particular those of the cobalamin group. The most referred to compound 
of this group is cyanocobalamin and as such the term vitamin Bi 2 is sometimes used 

30 to refer to cyanocobalamin. In this specification the term vitamin B 12 should be 
attributed its broad meaning so as to include all the cobalt corrinoids of the 
cobalamin group, which include in particular cyanocobalamin, hydroxocobalamin, 
methylcobalamin and S'-desoxyadenosylcobalamin characterised by a cyano, 
hydroxyl, methyl or 5 '-desoxyadenosyl radical respectively. 
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Vitamin B12 is produced industrially by microbial fermentation, especially using 
Pseudomonas denitrificans. However, current production levels of vitamin Bi 2 do not 
always enable cost-effective production of vitamin Bi 2 . To increase vitamin B !2 
productivity, efforts need to be made to improve the fermentation process. 
5 The biosynthetic pathway of vitamin B n in Pseudomonas denitrificans has been 

well characterized 24 . This has elucidated most of the pathway 25 . A total of 22 
enzymes were purified, and 22 cob genes have been identified. The role of some of 
these genes however is still unknown. It is thought that a closely related, but 
somewhat different pathway, operates in Propionibacterium shermanii. 

10 In addition, workers have studied the cobalamine by synthetic pathway in 

Salmonella typhimurium. The S. typhimurium cob operon was isolated and cloned 
into E. colU and that approach gave the new host the ability to make cobalamins de 
novo, an ability which did not previously exist. In terms of patent publications, 
Blanche of Rhone Poulenc Rorer is referred to concerning biosynthesis methods 

1 5 enabling the preparation of cobalamins . 

A total of 14 genes, encoding enzymes responsible for 17 steps of the anaerobic 
BT2 pathway in Propionibacterium freudenreichii, have been proposed 41 . However, 
this document does not give any sequences, and expresses only two gene products, 
allegedly causing methylation. They are expressed in E. colu and there is no 

20 disclosed use of these genes to actually produce vitamin Bi 2 . 

While vitamin B i2 has been produced industrially using Propionibacterium 
species, the yields and production levels are not entirely satisfactory, and there is 
room for improvement. Therefore, research has been undertaken to elucidate the. 
biosynthetic pathway in Propionibacterium freudenreichii, and as a result four 

25 different genes and enzymes, of the present invention, have been identified. This 
allows improvements in vitamin B i2 yield on an industrial scale. 

Summary of the Invention 

30 Novel enzymes are now provided which are involved in the biosynthesis of 

vitamin Bi 2 . 
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At its broadest, the invention in a first aspect relates to a synthase or transferase from 
a Gram positive bacteria from the Order Actinomycetales, for example famil y 
Propionibacteriaceae, such as of the genus Propionibacterium, such as the species 
Propionibacterium freudenreichii. These enzymes are (e.g. amide) synthases or (e.g. 
5 phospho or nucleotidyl) transferases. Preferably they have the activity EC 6.3.1.-, 
2.7.7-, 2.7.8- or 2.5.1. 17. 

More specifically, the present invention provides, in a first aspect, an (isolated 
and/or purified) synthase or transferase polypeptide comprising: 

(i) the amino acid sequence of SEQ ID No: 2, 4, 6 or 8; or 
10 (ii) a variant of (i) which is a synthase or transferase; or 

(iii) a fragment of (i) or (ii) which is a synthase or transferase. 

According to a second aspect of the invention there is provided a polynucleotide 
which comprises: 

(a) the nucleic acid sequence of SEQ ID No. 1, 3, 5 or 7, or a sequence 
. 15 encoding a polypeptide of the invention; 

(b) a sequence which is complementary to, or which hybridises to, any 
sequence as defined in (a); 

(c) a fragment of any sequence in (a) or (b); 

(d) a sequence having at least 60, 65 or 70% identity to any sequence as 
20 defined in (a), (b) or (c); or 

(e) a sequence that is degenerate as a result of the genetic code to any of the 
sequences as defined in (a) to (d). 

The invention also provides: 

an (e.g. expression) vector (third aspect) which comprises a 
25 polynucleotide of the invention and which may be capable of 

expressing a polypeptide of the invention; 

a host (fourth aspect), such as cell line or strain comprising a vector of 

the invention; 

a method of producing a polypeptide of the invention which method 
30 comprises maintaining a cell line or strain of the invention under 

conditions suitable for obtaining expression of the polypeptide and, if 
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necessary, isolating the polypeptide; and 
a method of producing vitamin B 12 or a precursor thereof (fifth 
aspect), the method comprising contacting a substrate with a 
polypeptide or host* cell of the invention. 

5 . 

Brief Description of the Sequences 

SEQ ID No. 1 is a DNA sequence of a first enzyme, an amide synthase, of the 
invention from Propionibacterium freudenreichii; 
10 SEQ ID No. 2 is the amino acid sequence of the first enzyme (A); 

SEQ ID No. 3 is a DNA sequence of a second enzyme, a (phospho and/or 

nucleotidyl) transferase from the same organism; 
SEQ 3D No. 4 is the amino acid sequence of the second enzyme (B); 
SEQ ID No. 5 is a DNA sequence of a third enzyme, a transferase . also from the same 
15 organism; 

SEQ ID No. 6 is the amino acid sequence of the third enzyme (C); 

SEQ ID No. 7 is a DNA sequence of a fourth enzyme, a (nucleotidyl) transferase also 

from the same organism; 

SEQ ID No. 8 is the amino acid sequence of the fourth enzyme (D); and 
20 SEQ ID Nos. 9 to 17 are primers. 

Detailed Description of the Invention 

A. Polynucleotides 

25 The present invention provides a (e.g. isolated and/or purified) polynucleotide 

encoding polypeptides of the invention. The present invention thus provides a 
polynucleotide, preferably encoding a synthase or transferase whose amino acid 
sequence is set out in SEQ ID No. 2, 4, 6 and/or 8. The present invention further 
provides a polynucleotide encoding a polypeptide having substantial amino acid 

30 sequence homology to the amino acid sequence set out in SEQ ID No. 2, 4, 6 and/or 
8. Also included is a polynucleotide selected from: 
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(a) a polynucleotide comprising the nucleotide sequence set out in SEQ ID No. 
1, 3, 5 and/or 7, or the complement thereof; 

(b) a polynucleotide comprising a nucleotide sequence capable of (e.g. 
selectively) hybridising to a nucleotide sequence set out in SEQ ID No. 1, 

5 3, 5 or 7, or a fragment thereof; 

(c) a polynucleotide comprising a nucleotide sequence capable of (e.g. 
selectively) hybridising to the complement of the nucleotide sequence set 
out in SEQ ID No. 1, 3, 5 or 7, or a fragment thereof; and/or 

(d) a polynucleotide comprising a polynucleotide sequence that is degenerate 
10 as a result of the genetic code to a polynucleotide defined in (a), (b) or (c). 

Polynucleotides may comprise 2, 3 or more sequences of the invention; for example 
SEQ ID 

Nos. 3 and 5 or 1, 3 and 5 (or variants thereof as defined in any of (b), (c) and (d) 
above). 

15 A polynucleotide of the invention may also include a polynucleotide which: 

(a) encodes a polypeptide having synthase or transferase activity, which 
polynucleotide is: 

(1) the coding sequence of SEQ ID No. 1, 3, 5 or 7; 

(2) a sequence which hybridises selectively to the complement of 
20 sequence defined in (1); or 

(3) a sequence that is degenerate as a result of the genetic code with 
respect to a sequence defined in (1) or (2); or 

(b) is a sequence complementary to a polynucleotide defined in (a). 

25 Hybridisable sequences 

The term "capable of hybridizing" means that the target polynucleotide of the 
invention can hybridize to a nucleic acid used as a probe (for example the nucleotide 
sequence set out in SEQ. ID No. 1, 3, 5 or 7, or a fragment thereof or the complement 
30 thereof) at a level significantly above background. The invention also includes 

nucleotide sequences that encode for synthase or transferase or variants thereof as 
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well as nucleotide sequences that are complementary thereto. The nucleotide 
sequence may be RNA or DNA and thus includes genomic DNA, synthetic DNA or 
cDNA). Preferably the nucleotide sequence is a DNA sequence if appropriate and, a 
cDNA sequence. Typically a polynucleotide of the invention comprises a contiguous 
5 sequence of nucleotides which is capable of hybridizing under selective conditions to 
the coding sequence or the complement of the coding sequence of SEQ ID No. 1,3,5 
or 7, as appropriate. Such nucleotides can be synthesized according to methods well 
known in the art 1 . 

A polynucleotide of the invention can hybridize to the coding sequence or the 

10 complement of the coding sequence of SEQ ID No.l, 3, 5 or 7 (as appropriate) at a 
level significantly abqve background. Background hybridization may occur, for 
example, because of other cDNAs present in a cDNA library. The signal level (e.g. 
generated by the interaction between a polynucleotide of the invention and the coding 
sequence or complement . of the coding sequence of SEQ ID No. 1, 3, 5 or 7) is 

15 typically at least 10 fold, preferably at least 100 fold, or as intense as interactions, 
between other polynucleotides and the coding sequence of SEQ ID No. 1, 3, 5 or 7. 
The intensity of interaction may be measured, for example, by radiolabelling the 
probe, e.g. with 32 P. Selective hybridization may typically be achieved using 
conditions of low stringency (0.3M sodium chloride and 0.03M sodium citrate at 

20 about 40°C), medium stringency (for example, 0.3M sodium chloride and 0.03M 

sodium citrate at about 50°C) or high stringency (for example, 0.3M sodium chloride 
and 0.03M sodium citrate at about 60°C). Hybridization may be carried out under 
any suitable conditions known in the art 1 and, as a guide, low stringency can be 2 x 
SSC at 55°C, medium stringency can be 0.5 to 1.0 x SSC at 60°C and high stringency 

25 can be 0.1 or 0.2 x SSC at 60°C or higher (e.g. at 68°C), all at 0.5% SDS. 

Modifications 



30 



Polynucleotides of the invention may comprise DNA or RNA. They may be 
single or double stranded. They may also be polynucleotides which include within 
them one or more synthetic or modified nucleotides. A number of different types of 
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modifications to polynucleotides are known in the art. These include 
methylphosphonate and phosphorothioate backbones and/or addition of acridine or 
polylysine chains at the 3' and/or 5' ends of the molecule. For the purposes of the 
present invention, it is to be understood that the polynucleotides described herein 
5 may be modified by any method available in the art. 

It is to be understood that skilled persons may, using routine techniques, make 
nucleotide substitutions that do not affect the polypeptide sequence encoded by the 
polynucleotides of the invention to reflect the codon usage of any particular host 
organism, for example in which the polypeptides of the invention are to be expressed. 
10 The coding sequence of SEQ ID No. 1, 3, 5 or 7 may be modified by nucleotide 

substitutions, for example from or up to 1, 2 or 3 to 10, 25, 50 or 100 substitutions. 
The polynucleotide may alternatively or additionally be modified by one or more 
insertions and/or deletions and/or by an extension at either or both ends. The 
modified polynucleotide generally encodes a polypeptide which has synthase or 
15 transferase activity. Degenerate substitutions may be made and/or substitutions may. 
be made which would result in a conservative amino acid substitution when the 
modified sequence is translated, for example as discussed with reference to 
polypeptides later. 

20 Homologues. 

A nucleotide sequence which is capable of selectively hybridizing to (e.g. the 
complement of) the DNA coding sequence of SEQ ID No. 1, 3, 5 or 7 may have at 
least 50% or 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 98% 

25 or at least 99% sequence identity (or homology) to the coding sequence of SEQ ED . 
No. 1, 3, 5 or 7. This may be over a region of at least 20, preferably at least 30, for 
instance at least 40 or 50 such as at least 60 or 80, more preferably at least 100, 200, 
400, 500 or even 600 contiguous nucleotides or optimally over the fall length of SEQ 
ID No. 1, 3, 5 or 7. For individual sequences the sequence identity maybe: 

30 (a) for SEQ ID No, 1, at least 85% or 90%; 

(b) for SEQ ID No. 3, at least 70%; 
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(c) for SEQ ID No. 5, at least 90% or 95%; and/or 

(d) for SEQ ID No. 7, at least 90%, 95% or 98%. 

Any combination of the above mentioned degrees of homology and minimum 
sized may be used to define polynucleotides of the invention, with the more stringent 
5 combinations (i.e. higher homology over longer lengths) being preferred. Thus for 
example a polynucleotide which is at least 80% or 90% homologous over 25, 
preferably over 30 nucleotides forms one aspect of the invention, as does a 
polynucleotide which is at least 90% homologous over 40 nucleotides. 

Homologues of polynucleotide (or protein) sequences typically have at least 

10 70% homology, preferably at least 80, 90%, 95%, 97% or 99% homology, for 

example over a region of at least 15, 20, 30, 100 more contiguous nucleotides (or 
amino acids). The homology may calculated on the basis of amino acid identity 
(sometimes referred to as "hard homology"). Identity or homology is usually 
calculated on the basis of the entire sequence unless otherwise specified. 

15 For example the UWGCG Package provides the BESTFIT program which can 

be used to calculate homology (for example used on its default settings 5 ). The 
PILEUP and BLAST algorithms can be used to calculate homology or line up 
sequences (such as identifying equivalent or corresponding sequences, for example 
on their default settings 6,7 ). 

20 Software for performing BLAST analyses is publicly available through the 

National Center for Biotechnology Information (http://www.ncbi.iilm.nih.gov/). This 
algorithm involves first identifying high scoring sequence pair (HSPs) by identifying 
short words of length W in the query sequence that either match or satisfy some 
positive-valued threshold score T when aligned with a word of the same length in a 

25 database sequence. T is referred to as the neighbourhood word score threshold 6,7 . 
These initial neighbourhood word hits act as seeds for initiating searches to find 
HSPs containing them. The word hits are extended in both directions along each 
sequence for as far as the cumulative alignment score can be increased. Extensions . 
for the word hits in each direction are halted when: the cumulative alignment score 

30 falls off by the quantity X from its maximum achieved value; the cumulative score 
goes to zero or below, due to the accumulation of one or more negative-scoring 
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residue alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, T and X detemiine the sensitivity and speed of the alignment. The 
BLAST program uses as defaults a word length (W) of 11, the BLOSUM62 scoring 
matrix 8 alignments (B) of 50, expectation (E) of 10, M=5, N=4, and a comparison of 
5 both strands. 

The BLAST algorithm performs a statistical analysis of the similarity between 
two sequences 9 . One measure of similarity provided by the BLAST algorithm is the 
smallest sum probability (P(N)), which provides an indication of the probability by 
which a match between two nucleotide or amino acid sequences would occur by 

10 chance. For example, a sequence is considered similar to another sequence if the 
smallest sum probability in comparison of the first sequence to the second sequence 
is less than about 1, preferably less than about 0.1, more preferably less than about 
0.01, and most preferably less than about 0.001 . 

Fragments, homologues and other variants maybe at least 500 or 550 

15 nucleotides in length (e.g. for SEQ. ID. No. 7) and may encode at least (e.g. the first) 
170, 180 or 200 amino acids of the corresponding protein. 

Primers and Probes 

20 Polynucleotides of the invention include and may be used as a primer, e.g. a 

PCR primer, a primer for an alternative amplification reaction, a probe, or the 
polynucleotides may be cloned into vectors. Such primers, probes and other 
fragments will be at least or up to 20, for example at least 25, 30 or 40 nucleotides in 
length. They will typically be up to 40, 50, 60, 70, 100, 150, 200 or 300 nucleotides 

25 in length, or even up to the same number or a few nucleotides (such as 5 or 10 
nucleotides) short of the coding sequence of SEQ ID No. 1, 3, 5 or 7. 

In general, primers will be produced by synthetic means, involving a step-wise 
manufacture of the desired nucleic acid sequence one nucleotide at a time. 
Techniques for accomplishing this using automated techniques are readily available 

30 in the art. 

Longer polynucleotides will generally be produced using recombinant means, 
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for example using PCR (polymerase chain reaction) cloning techniques. This will 
involve making a pair of primers (e.g. of about 15-30 nucleotides) to a region of the 
synthase or transferase which it is desired to clone, bringing the primers into contact 
with mRNA or cDNA obtained from a target (e.g yeast, bacterial, plant, prokaryotic 
5 or fungal) cell, preferably of an a bacterial, e.g. Propionibacterium strain, performing 
a polymerase chain reaction under conditions which bring about amplification of the 
desired region, isolating the amplified fragment (e.g. by purifying the reaction 
mixture on an agarose gel) and recovering the amplified DNA. The primers may be 
designed to contain suitable restriction enzyme recognition sites so that the amplified 

10 DNA can be cloned into a suitable cloning vector. 

Such techniques may be used to obtain all or part of the synthase or transferase 
sequence described herein. Genomic clones corresponding to the cDNA of SEQ ID 
No. 1, 3, 5 or 7 or the synthase or transferase gene containing,, for example, introns 
and promoter regions are within the invention also and may also be obtained in an 

15 analogous manner (e.g! recombinant means, PCR, cloning techniques), starting with 
genomic DNA from a fungal, yeast, bacterial plant or prokaryotic cell. 

The polynucleotides or primers may carry a revealing label, e.g. a radioactive or 
non-radioactive label. Suitable labels include radioisotopes such as 32 P or 35 S, 
enzyme labels, or other protein labels such as biotin. Such labels may be added to 

20 polynucleotides or primers of the invention and may. be detected using techniques 
known per se. 

Polynucleotides, labelled or unlabelled may be used in nucleic acid-based tests 
for detecting or sequencing a synthase or transferase or a variant thereof in a (e.g. 
bacterial) sample. Such tests for detecting generally comprise bringing a (e.g. 

25 bacterial) sample (suspected of) containing DNA into contact with a probe or primer 
of the invention under hybridizing conditions and detecting any duplex formed 
between the probe and nucleic acid in the sample. Such detection may be achieved 
using techniques such as PCR or by immobilizing the probe on a solid support, 
removing nucleic acid in the sample which is not hybridized to the probe, and then 

30 detecting nucleic acid which was hybridized to the probe. Alternatively, the sample 
nucleic acid may be immobilized on a solid support, and the amount of probe bound 
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to such a support can be detected. 

The probes of the. invention may conveniently be packaged in the form of a test 
kit in a suitable container. In such kits the probe may be bound to a solid support 
where the assay format for which the kit is designed requires such binding. The kit 
5 may also contain suitable reagents for treating the sample to be probed, hybridizing ' 
the probe to nucleic acid in the sample, control reagents, instructions, and the like. 

Preferably, the polynucleotide of the invention is obtainable from the same 
organism as the polypeptide, such as a bacteria, in particular a bacteria of the family 
Mycobacteriaceae, preferably of the genus Propionibacterium. 
10 The polynucleotides of the invention also include variants of the sequence of 

SEQ ID No. 1, 3, 5 or 7 which have synthase or transferase activity. Variants may be 
formed by additions, substitutions and/or deletions and may have the ability to act as 
a synthase or transferase, or have the activity EC 6.3.1-, 2.7.7 (or 8).- or 2.5.1.17. 

15 Production of polynucleotides 

Polynucleotides which do not have 100% identity with SEQ ID No'. 1, 3, 5 or 7 
but fall within the scope of the invention can be obtained in a number of ways. Thus 
variants of the sequences described herein may be obtained for example by probing 

20- genomic DNA libraries made from a range of organisms, for example those 

discussed as sources of the polypeptides of the invention. In addition, other bacterial 
or prokaryotic homologues may be obtained and such homologues and fragments 
thereof in general will be capable of hybridising to SEQ ID No. 1, 3, 5 or 7. Such 
sequences may be obtained by probing cDNA libraries or genomic DNA libraries 

25 from other species, and probing such libraries with probes comprising all or part of 
SEQ ID. 1, 3, 5 or 7 under conditions of medium to high stringency (as described 
earlier). Nucleic acid probes comprising all or part of SEQ ID No. 1, 3, 5 or 7 may 
be used to probe cDNA libraries from other species, such as those described as 
sources for the polypeptides of the invention. 

30 Species homologues may also be obtained using degenerate PCR which will use 

primers designed to target sequences within the variants and homologues encoding 
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conserved amino acid sequences. The primers can contain one or more degenerate 
positions and will be used at stringency conditions lower than those used for cloning 
sequences with single sequence primers against known sequences. 

Alternatively, such polynucleotides may be obtained by site directed 
5 mutagenesis of the 

synthase or transferase sequences or variants thereof. This may be useful where for 
example silent codon changes are required to sequences to optimise codon 
preferences for a particular host cell in which the polynucleotide sequences are being 
expressed. Other sequence changes may be desired in order to introduce restriction 
10 enzyme recognition sites, or to alter the property or function of the polypeptides 
encoded by the polynucleotides. 

The invention includes double stranded polynucleotides comprising a 
polynucleotide of the invention and its complement. 

The present invention also provides polynucleotides encoding the polypeptides 
15 of the invention described below. Since such polynucleotides will be useful as 
sequences for recombinant production of polypeptides of the invention, it is not 
necessary for them to be capable of hybridising to the sequence of SEQ ID No. 1, 3, 5 
or 7, although this will generally be desirable. Otherwise, such polynucleotides may 
be labelled, used, and made as described above if desired. 

20 

B. Polypeptides 

The present invention relates to an (e.g. (substantially) purified and/or isolated) 
synthase or transferase or variant thereof as defined later. The polypeptides of the 
25 invention may consist essentially of the amino acid sequence of SEQ ID No. 2, 4, 6 
or 8 or of a variant of that sequence. Polypeptides may also be encoded by a 
polynucleotide of the invention as described above. 

A polypeptide of the invention may be in an isolated or a substantially purified 
form. It will be understood that the polypeptide may be mixed with carriers or 
30 diluents which will not interfere with the intended purpose and/of function of the 

polypeptide and still be regarded as substantially isolated. It will generally comprise 
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the polypeptide in a preparation in which more than 20%, e.g. more than 30%, 40%, 
50%, 80%, 90%, 95% or 99%, by weight of the polypeptide in the preparation is a 
polypeptide of the invention. Routine methods can be employed to purify and/or 
synthesise the proteins according to the invention 1 . For some formulations (e.g. for 
5 non-pharmaceutical uses) the amount of polypeptide present may be small, for. 
example from 0.01 to 10%, such as from 0.1 to 5%, or 2% or even from 0.2 to 1%. 

Preferably, the polypeptide of the invention is obtainable from a microorganism, 
for example one which possesses a gene encoding an enzyme with synthase or 
transferase activity. More preferably the microorganism is bacterial, such as Gram 

10 positive bacteria. The microrganism may be of the Phylum or class Actinobacteria, 
e.g.Subclass Adinobacteridae. Preferably the microorganism is of Order 
Actinomycetales, such as Sub order Propionibacterineae and optimally from the 
family Propionibacteriaceae. Preferred organisms are thus of the genus 
Propionibacterium, such as of the species Propionibacterium frendenreichii. 

15 Preferably the microorganism is capable of producing or synthesising vitamin B 1 2. 



Activity 



A polypeptide of the invention can have one or more of the following features, 
20 namely it: 

(1) possesses synthase or transferase activity; 

(2) acts as an amide synthase or a phospho-, nucleotidyl- or aryl-transferase; 

(3) catalyses at least one step in the vitamin B l2 biosynthetic pathway; 

(4) has an activity within EC 6.3.1-, EC 2.7.7-, EC 2.7.8- or EC 2.5.1.17; 

25 (5) has a length of 150 or 170 to 270 or 300 amino acids or from 800 or 840 to 

880 or 920 amino acids; 
(6) is a cobyrinic acid -a,c-diamide synthase, a cobinamide kinase, a 

cobinamide phosphate guanyl transferase, a cobalamin (5 '-phosphate) 

synthase and/or an adenosyl transferase; 
30 (7) acts on a substrate, or produces a product, which comprises: 

(i) a corrin core or ring system; 
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(ii) up to 4 aryl, optionally pyrrole, rings; 

(iii) a tetrapyrrole ring system and/or a transition metal (e.g. cobalt) atom; 

and/or 

(iv) an amide, phosphate, guanidyl, aryl or adenosyl moiety or group; 

5 and/or 

(8) catalyses amidation, phosphorylation, nucleotidylation, arylation, ribazole 

or adenosyl addition and/or adenosylation. 
A table of the main characteristics of the polypeptides of the invention is given 
below. 
10 . " 



Desig 
nation 
/ 

reacti 
on 


DNA 

seq 

No. 

(length, 

nucs) 


Protein 

seq 

No. 


Seq 
length 
(amino 
acids) 


Enzyme activity/ 
activities 


Substrate 


Product 
(formula) 


Enzyme 
class/ type 


Gene 
refere 
No. (. 
name] 


A 


1 

(2586) 


2 


861 


cobyrinic acid 
a,c diamide 
synthase 
(EC 6.6.3.-) 


Cobyrinic 
acid© 


Cobyrinic 
a,c diamide 
(IB) (with 
intermediate 
cobyrinic acid 
c-amide, IA) 


(amide) 
synthase. 


PER: 

(cobA 


B(B1, 
B2) 








cobinamide kinase 
(EC 2.7.1.-) 


Adenosyl 
cobinamide 

(n) 


Adenosinyl 
cobinamide 
phosphate (HA) 


(phospho) 
transferase 


PFR 3 
(cobt 


3 

(657) 


4 


218 


cobinamide 
phosphate guanyl 
transferase 
(EC 2.7.7.-) 


Adenosyl 

cobinamide 

phosphate 

(ha) 


Adenosyl-GDP- 
cobamide (HB) 


(nucleotidyl 
) transferase 


C 


5 

(780) 


6 


256 . 


cobalamin (5*- 
phosphate) 
synthase 
(EC 2.7.8.-) 


Adenosyl- 
GDP- 
cobamide 
(OB) 


Adenosyl-5,6- 

dimethyl 

benzimidazolyl 

cobamide, 

(adenosyl 

cobalamin i.e. 

vitamin B 12 (IIC) 


(aryl) 

transferase 


PFR 1 
(cobS) 


D 


7 

(603) 


8 


200 


(cob(l)alamin) 
adenosyl transferase 
(EC 2.5.1.17) 


Cobyrinic 
acid a, c 
diamide 
OB) 


Adenosyl 
cobyrinic acid 
a, c diamide (IC) 


(nucleotidyl 
) transferase 


PFR1 



15 



Variants and Homologues 

A polypeptide of the invention can comprise the amino acid sequence set out in 
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SEQ ID No. 2, 4, 6 or 8 (or a variant thereof, such as) a substantially homologous 
sequence or a fragment of either sequence and can have synthase or transferase 
activity. In general, the naturally occurring amino acid sequence shown in SEQ ID 
No. 2, 4, 6 or 8 is preferred. 
5 In particular, the polypeptide of the invention may comprise: 

(a) the polypeptide sequence of SEQ ID No. 2, 4, 6 or 8; 

(b) a naturally occurring variant or species homolog, paralog or ortholog 
thereof; or 

(c) a protein with at least 70, at least 80, at least 90, at least 95, at least 98 or at 
10 least 99% sequence identity to (a) or (b). 

A variant may be one that occurs naturally, for .example in fungal, bacteria, yeast 
or plant cells and which can function in a substantially similar manner to the protein 
of SEQ ID No. 2, 4, 6 or 8, for example it has synthase or transferase activity. 
Similarly a species homolog of the protein will be the equivalent protein which 

15 occurs naturally in another species and which can function as a synthase or 

transferase enzyme. Variants include allelic variants either from the same strain as 
the polypeptide of the invention or from a different strain, but of the same, genus, or 
of the same species. 

Variants and species homologues can be obtained by following the procedures 

20 described herein for the production of the polypeptide of SEQ ID No. 2, 4, 6 or 8 and 
performing such procedures on a suitable cell source, for example a bacterial, yeast, 
fungal or plant cell. It will also be.possible to use a probe as defined above to probe 
libraries made from yeast, bacterial, fungal or plant cells in order to obtain clones 
including the variants or species homology. The clones can be manipulated by 

25 conventional techniques to generate a polypeptide of the invention which can then be 
produced by recombinant or synthetic techniques known per se. 

The polypeptide of the invention preferably has at least 70% sequence identity to 
any of the proteins of SEQ ID No. 2, 4, 6 or 8, more preferably at least 80%, at least 
90%, at least 95%, at east 97% or at least 99% sequence identity thereto, for example . 

30 for each sequence over a region of at least 60, at least 100, 150, 200, 250 or 300 (or 
even 500, 600, 700 or 800) contiguous amino acids or over the full length of SEQ ID 
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No. 2, 4, 6 or 8. For individual sequences the sequence identity may be: 

(a) for SEQ ID No. 2, at least 55%, 60% or 65%; 

(b) for SEQ ID No. 4, at least 50%, 55% or 60%; 

(c) for SEQ ID No. 6, at least 40%, 45% or 50%; and/or 

5 (d) for SEQ ID No/8, at least 90%, 95%, 98% or 99% (e.g. over, or the 

sequence being longer than, at least 150, 170, 200 or 230 amino 
acids). 

The sequence of the polypeptide of SEQ ID No. 2, 4, 6 or 8 and of variants and 
species homo logs can thus be modified to provide polypeptides of the invention. 

10 Amino acid substitutions may be made, for example from or up to 1, 2 or 3 to 10, 20, 
30, 50 or 100 substitutions. The same number of deletions and insertions may also 
be made. These changes may be made outside regions critical to the function of the 
polypeptide and so may still result in an active enzyme. The modified polypeptide 
generally retains activity as a synthase or transferase. 

15 Polypeptides of the invention include fragments of the above mentioned full 

length polypeptides and of variants thereof, including fragments of the sequence set 
out in SEQ ID No. 2, 4, 6 or 8. Such fragments typically retain activity as a synthase 
or transferase. Fragments maybe at least 50, 100, 150, 200 or 250 amino acids long 
or may be this number of amino acids short of the full length sequence (as shown in 

20 SEQ ID No. 2, 4, 6 or 8). 

Polypeptides of the invention can if necessary be produced by synthetic means 
- although usually they will be made recombinantly as described below. They may be 
modified for example by the addition of histidine residues or a T7 tag to assist their 
identification or purification or by the addition of a signal sequence to promote their 

25 secretion from a cell. 

The term "variants 11 refers to polypeptides which have the same essential 
character or basic biological functionality as the synthase or transferase, and include 
allelic variants. The essential character of synthase is that it is an enzyme that 
exhibits EC 6.3.-.-. (e.g. EC 6.3.1.-.) activity or can add an amine group to a substrate 

30 (e.g. amidate). For a transferase, this is an enzyme that exhibits EC 2.7.-.-. (e.g. EC 
2.7.7 or 8.-) or EC 2.5.-.-. (e.g. EC 2.5.1.-, such as EC 2.5.1.17) activity and can 
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transfer a substituent or chemical moiety from one compound to another. Preferably 
a variant polypeptide has the same activity. A polypeptide having the same essential 
character as may be identified by performing a substrate degradation assay. 

Variants of SEQ ID No. 2, 4, 6 or 8 also include sequences which vary from 
5 SEQ ID No. 2, 4, 6 or 8 but which are not necessarily derived from the naturally 
occurring enzyme. These variants may be described as having a.percentage 
- homology to SEQ ID No. 2, 4, 6 or 8 or having a number of substitutions within this 
sequence. Alternatively a variant may be encoded by a polynucleotide which 
hybridizes to SEQ ID No. 1, 3, 5 or 7. • 

10 The variants can be defined in a similar manner to the variants of SEQ ID No. 1 , 

3, 5 or 7. Thus the variants may comprise variant sequences derived from other 
bacterial strains, e.g. Propionibacterium. Other variants can be identified from other 
strains by looking for synthase or transferase, activity and cloning and sequencing as 
before. Variants may include the deletion, modification or addition of single amino 

15 acids or groups of amino acids within the protein sequence, as long as the peptide 
maintains the basic biological functionality, e.g. a synthase or transferase. 

Conservative substitutions may be made, for example according to the following 
Table. Amino acids in the same block in the second column and preferably in the 
same line in the third column may be substituted for each other. Preferably 

20 substitutions do not affect the folding or activity of the polypeptide. 



ALIPHATIC 


Non-polar 


GAP 






ILV 




Polar-uncharged 


CS TM 






NQ 




Polar-charged 


DE 






KR 


AROMATIC . 




HF WY 



Shorter polypeptide sequences are within the scope of the invention. For 
example, a peptide of at least 50 amino acids or up to 60, 70, 80, 100, 150, 200, 400, 
500, 600 or 700 amino acids in length is considered to fall within the scope of the 
25 invention as long as it demonstrates the basic biological functionality of the synthase 
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or transferase. In particular, but not exclusively, this aspect of the invention 
encompasses the situation when the protein is a fragment of the complete protein 
sequence and may comprise or represent a substrate binding region, cleaving and/or 
transferring region. 

5 

Modifications . 

. Polypeptides of the invention may be chemically modified, e.g. post- 
translationally 

10 . modified. For example, they may comprise modified amino acid residues. They may 
also be modified by the addition of histidine residues (to assist their purification) or 
by the addition of a signal sequence (to promote insertion into the cell membrane). 
The polypeptide may have one or more (N) amino- or (C) carboxyl-terminal 
extensions, such as an ammo-terminal methionine residue, a small linker peptide of 

15 up to about 20-25 residues, or a (small) extension that facilitates purification, such as 
a poly-histidine or T7 tag, an antigenic epitope or a (e.g. maltose) binding domain 14 
(e.g. at the C-terminus). These extensions may or may not be added via a linker. 

A polypeptide of the invention may be labelled with a revealing label. The 
revealing label may be any suitable label which allows the polypeptide to be detected. 

20 Suitable labels include radioisotopes, e.g. l25 I, 35 S, enzymes, antibodies, 
polynucleotides and linkers such as biotin. 

The polypeptide may be modified to include non-naturally occurring amino 
acids or to increase the stability of the polypeptide. When the peptide is produced by 
synthetic means, such amino acids may be introduced during production. The 

25 peptide may also be modified following either synthetic or recombinant production. 

The polypeptides of the invention may also be produced using, or comprise, (one 
or more) D-amino acids. 

A number of side chain modifications are known in the art and maybe made to 
the side chains of the proteins or peptides of the present invention. Such 

30 modifications include, for example, modifications of amino acids by reductive 
alkylation by reaction with an aldehyde followed by reduction with NaBHU, 
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amidination with methylacetimidate or acylation with acetic anhydride. 

The sequences provided by the present invention may also be used as starting 
materials for the construction of "second generation" enzymes. "Second generation" 
enzymes are ones altered by mutagenesis techniques (e.g. site-directed mutagenesis), 
5 which have properties that differ from those of the wild-type or recombinant enzymes 
such as those produced by the present invention. For example, the temperature or pH 
optimum, specific activity, substrate affinity or thermostability maybe altered so as 
to be better suited for application in a defined process. 

Amino acids essential to activity, and therefore preferably subject to 

10 substitution, may be identified according to procedures known in the art, such as site- 
directed mutagenesis or alanine-scanning mutagenesis 10 . In the latter technique 
mutations are introduced at every residue in the molecule, and the resultant mutant 
molecules are tested for biological activity (e.g. synthase or transferase activity) to 
identify amino acid residues that are critical to the activity of the molecule. Sites of 

15 enzyme-substrate interaction can also be determined by analysis of crystal structure 
as determined by such techniques as nuclear magnetic resonance, crystallography or 
photo-affinity labelling 1 1,12,13 or molecular modelling. 

The use of yeast and fungal host cells may provide any post-translational 
modifications (e.g. proteolytic processing, myristilation, glycosylation, truncation, 

20 and tyrosine, serine or threonine phosphorylation) influencing biological activity on 
recombinant expression products of the invention. 

Polypeptides of the invention may be provided in a form such that they are 
outside their natural cellular environment. Thus, they may be substantially isolated 
or purified, as discussed abovei or in a cell in which they do not occur in nature, e.g. 

25 a cell of other bacterial species, animals, yeast or fungi. 

C. Recombinant Aspects . 



30 



The invention also provides vectors comprising a polynucleotide of the 
invention, including cloning and expression vectors, and methods of growing, 
transforming or transfecting such vectors in % suitable host cell, for example under 
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conditions in which expression of a polypeptide of the invention occurs. Provided 
also are host cells comprising a polynucleotide or vector of the invention wherein the 
polynucleotide is heterologous to the genome of the host cell. The term 
"heterologous", usually with respect to the host cell, means that the polynucleotide 
5 does not naturally occur in the genome of the host cell or that the polypeptide is not 
naturally produced by that cell. Preferably, the host cell is a bacterial cell, for 
example a (e.g. Gram positive) cell of the family Propionibacteriaceae. 

Polynucleotides of the invention can be incorporated into a recombinant 
replicable vector, for example a cloning or expression vector. The vector may be 

10 used to replicate the nucleic acid in a compatible host cell. Thus in a further 

embodiment, the invention provides a method of making polynucleotides of the 
invention by introducing a polynucleotide of the invention into a replicable vector, 
introducing the vector into a compatible host cell, and growing the host cell under 
conditions which bring about replication of the vector. The vector may be recovered 

15 from the host cell. Suitable host cells are described below in connection with 
expression vectors. 

Vectors 

20 The polynucleotide of the invention may be inserted into an expression cassette. 

The vector into which the expression cassette or polynucleotide of the invention is 
inserted may be any vector which may conveniently be subjected to recombinant 
DNA procedures, and the choice of the vector will often depend on the host cell into 
which it is to be introduced. Thus, the vector may be an autonomously replicating 

25 vector, i.e. a vector which exists as an extra-chromosomal entity, the replication of 
which is independent of chromosomal replication, e.g. a plasmid. Alternatively, the 
vector may be one which, when introduced into a host cell, is integrated into the host 
cell genome and replicated together with the chromosome(s) into which it has been 
integrated. 

30 Preferably, a polynucleotide of the invention in a vector is operably linked to a 

regulatory sequence which is capable of providing for the expression of the coding 
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sequence by the host cell, i.e. the vector is an expression vector. The term "operably 
linked" refers to a juxtaposition wherein the components described are in a 
relationship pennitting them to function in their intended manner. A regulatory 
sequence such as a promoter, enhancer or other expression regulation signal 
5 "operably linked" to a coding sequence is positioned in such a way that expression of 
the coding sequence is achieved under condition compatible with the control 
. sequences. 

The vector may be a plasmid, cosmid, virus or phage vector, usually provided 
with an origin of replication, optionally a promoter for the expression of the 

10 polynucleotide and optionally an enhancer and/or a regulator of the promoter. A 
terminator sequence may be present, as may be a polyadenylation sequence. The 
vector may contain one or more selectable marker geiies, for example an ampicillin 
resistance gene (in the case of a bacterial plasmid) or a neomycin resistance gene (for 
a mammalian vector). Vectors may be used in vitro, for example for the production 

15 of RNA or used to transfect or transform a host cell. They may comprise two or 
more polynucleotides of the invention, for example for overexpression. 

The vector may comprise two, three or all four polynucleotides of the invention, 
in other words at least two, three or four polynucleotide sequences that encode the 
four polypeptides of the invention (SEQ ID Nos. 2, 4, 6 and 8, or variants (fragments 

20 or substantially homologous sequences thereof), as defined earlier). Preferred 

combinations include the sequences encoding protein B (cobU) and protein C (cobS), 
optionally with the third protein A (cobA/cbiA). Thus, the vector may comprise SEQ 
ID Nos. 1, 3, 5 and/or 7, or fragments thereof, or sequences that hybridize thereto, as 
defined earlier in the specification. 

25 Preferably at least 2, 3 and (optimally) 4 of the polynucleotides are in the same 

operon, for example operon C. They may be arranged so that the vector (or eventual 
host) comprises an operon, or sequence, comprising sequences encoding one or more 
of the following enzymes, in the order: (nucleotidyl, transferase, (amide) synthase, 
(phospho) transferase and/or (nucleotidyl) transferase, (aryl) transferase. Thus, if all 

30 four polynucleotides are present, then the order in which they are arranged is 
preferably SEQ ID. Nos. 7,1,3, 5 (or variants of these sequences , as previously 
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defined). 

The DNA sequence encoding the polypeptide is preferably introduced into a 
suitable host as part of an expression cassette (or construct) in which the DNA 
sequence is operably linked to expression signals which are capable of directing 
5 expression of the DNA sequence in the host cells. For transformation of the suitable 
host with the expression construct transformation procedures are available which are 
well known to* the skilled person 3,4 . The expression construct can be used for 
transformation of the host as part of a vector carrying a selectable marker, or the 
expression construct may he co-transformed as a separate molecule together with the 
10 vector carrying a selectable marker. - The vector may comprise one or more selectable 
marker genes. 

Preferred selectable markers 15,16 include but are not limited to those that 
complement a defect in the host cell or confer resistance to a drug. They include e.g. 
versatile marker genes that can be used for transformation of bacteria (e.g. E. coli), 

15 most filamentous fungi and yeasts such as acetamidase genes or cDNAs (the a'mdS, 
niaD,facA genes or cDNAs from A. nidulans, A. oryzae, or A.niger), or genes 
providing resistance to antibiotics like G418, hygromycin, bleomycin, kanamycin, 
phleomycin or benomyl resistance (benA). Alternatively, specific selection markers 
can be used such as auxotrophic markers which require corresponding mutant host 

20 strains: e.g. URA3 (from S.cerevisiae or analogous genes from other yeasts), pyrG or 
pyrA (from A.nidulans or A.niger), argB (from A.nidulans or A.niger) or trpC. In a 
preferred embodiment the selection marker is deleted from the transformed host cell 
after introduction of the expression construct so as to obtain transformed host cells 
capable of producing the polypeptide which are free of selection marker genes 21,22 . 

25 Other markers include ATP synthetase, subunit 9 (o/zC), orotidine-S'-phosphate- 

decarboxylase (pvrA), the bacterial G418 resistance gene (this may also be used in 
yeast, but not in fungi), the ampicillin resistance gene (E. coif), the neomycin 
resistance gene (Bacillus) and the E. coli uidA gene, coding for (^glucuronidase 
(GUS). Vectors may be used in vitro, for example for the production of RNA or used 

30 to transfect or transform a host cell. 

For most (filamentous) fungi, yeast or bacteria, the vector or expression 
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construct is preferably integrated, such as in the genome of the host cell in order to 
obtain stable transformants. However, for certain yeasts also suitable episomal 
vectors are available into which the expression construct can be incorporated for 
stable and high level expression, examples thereof include vectors derived from the 
5 2\x and pKDl plasmids of Saccharomyces and Kluyveromyces, respectively, or 

vectors containing an AMA sequence (e.g. AMA1 from Aspergillus 3 * 0 ). Ia case the 
expression constructs are integrated in the host cell's genome, the constructs are 
either integrated at random loci in the genome, or at predetermined target loci using 
homologous recombination, in which case the target loci preferably comprise a 

10 highly expressed gene. A highly expressed gene is a gene whose mRNA can make up 
at least 0.01% (w/w) of the total cellular mRNA, e.g. under induced conditions, or 
alternatively, a gene whose gene product can make up at least 0.2% (w/w) of the total 
cellular protein (or, in case of a secreted gene.product, can be secreted to a level of at 
least 0.05g/l). A number of examples of suitable highly expressed genes are provided 

15 below. 

A vector or expression construct for a given host cell may comprise the 
following elements operably linked to each other in a consecutive order from the 5'- 
end to 

3 f -end relative to the coding strand of the sequence encoding the polypeptide of the 
20 first invention: 

(1) a promoter sequence capable of directing transcription of the DNA 
sequence encoding the polypeptide in the given host cell; 

(2) optionally, a signal sequence capable of directing secretion of the 
polypeptide from the given host cell into a culture medium; 

25 (3) a DNA sequence encoding a mature and preferably active form of the 

polypeptide; and preferably also 
(4) a transcription termination region (terminator) capable of terminating 

transcription downstream of the DNA sequence encoding the polypeptide. 
Downstream of the DNA sequence encoding the polypeptide there may be a 3' 
30 untranslated region containing one or more transcription termination sites (e.g. a 
terminator). The origin of the terminator is less critical. The terminator can e.g. be 
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native to the DNA sequence encoding the polypeptide. However, preferably a yeast 
terminator is used in yeast host cells, a filamentous fungal terminator is used in 
filamentous fungal host cells and a bacterial terminator in bacterial cells. More 
preferably, the terminator is endogenous to the host cell (in which the DNA sequence 
5 encoding the polypeptide is to be expressed). 

Enhanced expression of the polynucleotide encoding the polypeptide of the 
invention may also be achieved by the selection of heterologous regulatory regions, 
e.g. promoter, and/or terminator regions, which may serve to increase expression and, 
if desired, secretion levels of the protein of interest from the expression host and/or 
10 to provide for the inducible control of the expression of the polypeptide of the 
invention. 

Aside from the promoter native to the gene encoding the polypeptide of the 
invention, other promoters may be used to direct expression of the polypeptide of the 
invention. The promoter may be selected for its efficiency in directing the expression 

15 of the polypeptide of the invention in the desired expression host. 

Promoters/enhancers and other expression regulation signals may be selected to 
. be compatible with the host cell for which the expression vector is designed. For 
example prokaryotic promoters may be used, in particular those suitable for use in 
E.coli strains. When expression is carried out in mammalian cells, .mammalian 

20 promoters may be used. Tissues-specific promoters, for example hepatocyte cell- 
specific promoters, may also be used. Viral promoters may also be used, for example 
the Moloney murine leukaemia virus long terminal repeat (MMLV LTR), promoter 
rous sarcoma virus (RSV) LTR promoter, SV40 (e.g. large T antigen) promoter, 
human cytomegalovirus (CMV) IE promoter, heipes simplex virus promoters or 

25 adenovirus promoters, HS V promoters such as the HS V IE promoters), or HPV 

promoters, particularly the HPV upstream regulatory region (URR).Yeast promoters 
include S. cerevisiae GAL4 and ADH promoters, the S. pornbe nmt 1 and adh 
promoter. Mammalian promoters include the metallothionein promoter which may be 
induced in response to heavy metals such as cadmium and B-actin promoters. Tissue- 

30 specific promoters, in particular endothelial or neuronal cell specific promoters (for 
. example the DDAffl and DDAHU promoters), are especially preferred. 
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A variety of promoters 15,16 can be used that are capable of directing transcription 
in the host cells of the invention. Preferably the promoter sequence is derived from a 
highly expressed gene as- previously defined. Examples of preferred highly expressed 
genes from which promoters are preferably derived and/or which are comprised in 
5 preferred predetermined target loci for integration of expression constructs, include 
but are not limited to genes encoding glycolytic enzymes such as triose-phosphate 
isomerases (TPI), glyceraldehyde-phosphate dehydrogenases (GAPDH), 
phosphoglycerate kinases (PGK), pyruvate kinases (PYK), alcohol dehydrogenases 
(ADH), as well as genes encoding amylases, gluco amylases, proteases, xylanases, 

10 cellobiohydrolases, fl-galactosidases, alcohol (methanol) oxidases, elongation factors 
and ribosomal proteins. Specific examples of suitable highly expressed genes include 
e.g. the LAC4 gene from Kluyveromyces sp., the methanol oxidase genes (AOX and 
MOX) from Hansenula and Pichia, respectively, the glucoamylase (glaA) genes from 
A.niger and A.awamori, the A.oryzae TAKA-amylase gene, the A.nidulans gpdA 

15 gene and the T.reesei cellobiohydrolase genes. 

Examples of strong constitutive and/or inducible promoters which are preferred 
for use in fungal expression hosts 15,16 are those which are obtainable from the fungal 
genes for xylanase (xlnA), phytase, ATP-synthetase, subunit 9 (o/z'C), triose 
phosphate isomerase (tpi\ alcohol dehydrogenase (AdhA), a-amylase {amy), 

20 amyloglucosidase (AG - from the glaA gene), acetamidase (amdS) and 
glyceraldehyde-3-phosphate dehydrogenase (gpd) promoters. 

Examples of strong yeast promoters are those obtainable from the genes for 
alcohol dehydrogenase, lactase, 3-phosphoglycerate kinase and triosephosphate 
isomerase. 

25 Examples of strong bacterial promoters are the a-amylase and SPo2 promoters 

as well as promoters from extracellular protease genes. 

Promoters suitable for plant cells include napaline synthase (nos), octopine 
synthase (ocs), maonopine synthase (mas), ribulose small subunit (rubisco ssu), 
histone, rice actin, phaseolin, cauliflower mosaic virus (CMV) 35S and 19S and 
30 circovirus promoters. All these promoters are readily available in the art. 

The vector may further include sequences flanking the polynucleotide giving rise 
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to RNA which comprise -sequences homologous to eukaryotic genomic sequences, 
preferably mammalian genomic sequences, or viral genomic sequences. This will 
allow the introduction of the polynucleotides of the invention into the genome of 
eukaryotic cells or viruses by homologous recombination. In particular, a plasmid 
5 vector comprising the expression cassette flanked by viral sequences can be used to 
prepare a viral vector suitable for. delivering the polynucleotides of the invention to a 
mammalian cell. Other examples of suitable viral vectors include herpes simplex 
viral vectors 18,19 and retroviruses, including lentiviruses, adenoviruses, adeno- 
associated viruses and HPV viruses (such as HPV-1 6 or HPV-1 8). Gene transfer 

10 techniques using these viruses are known to those skilled in the art. Retrovirus 

vectors for example may be used to stably integrate the polynucleotide giving rise to 
the antisense RNA into the host genome. Replication-defective adenovirus vectors 
by contrast remain episomal and therefore allow transient expression. 

The vector may contain a polynucleotide of the invention oriented in an 

15 antisense direction to provide for the production of antisense RNA. This may be 
used to reduce, if desirable, the levels of expression of the polypeptide. 

For bacteria, specialised vectors can be employed, for example an 
expression vector or a plasmid. Suitable vectors and expression systems for 
Propionibacteria are known in the art 27 ' 28, 30 . For example one can use a plasmid 

20 from another Propionibacterium species, such as P. acidipropionici. This plasmid 
can be used to prepare a shuttle vector (such as pPK705) containing one or more of 
the six open reading frames of the P. acidipropionici plasmid. The vector may 
contain a drug marker, such as a hygromycin B resistant gene. This vector has been 
able to successfully transform Propionibacterium freudenreichii subspecies 

25 shermanii. Transformation may be by electroporation. 

Several promoters particularly suitable for Propionibacterium can be employed, 
in particular from Propionibacterium freudenreichii subspecies shermanii. These 
include the Propionibacterium bacterium promoters P4 and PI 38. 

Alternatively or in addition, one may use one or more endogenous plasmids of 

30 Propionibacterium, or vectors derived from such a plasmid, in order to express a 

preferably heterologous protein in bacteria. Such plasmids and vectors are known in 
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the art 29 and maybe based on plasmids from Propionibacterium bacterium 
LMG16545 (deposited under accession nos. CBS 101022 and CBS 101023). 

Host cells and Expression 
5 In a further aspect the invention provides a process for preparing a polypeptide 

according to the invention which comprises cultivating a host cell (e.g. transformed 
or transfected with an expression vector as described above) under conditions to 
provide for expression (by the vector) of a coding sequence encoding the 
polypeptide, and optionally recovering the expressed polypeptide. Polynucleotides of 

10 the invention can be incorporated into a recombinant replicable vector, e.g. an 
expression vector. The vector may be used to replicate the nucleic acid in a 
compatible host cell. It may contain at least one copy (such as multiple copies) of the 
polynucleotide of the invention. Thus in a further embodiment, the invention 
provides a method of making a polynucleotide of the invention by introducing a 

15 polynucleotide of the invention into a replicable vector, introducing the vector into a 
compatible host cell, and growing the host cell under conditions which bring about 
the replication of the vector. The vector may be recovered from the host cell. 
Suitable host cells include bacteria, preferably Gram positive e.g. of the family 
Propionibacteriaceae. Others include E. coli, yeast, mammalian cell lines and other 

20 eukaryotic cell lines, for example insect cells such as Sf9 cells and (e.g. filamentous) 
fungal cells. 

The polypeptide can be produced as a (secreted) protein in which case the DNA 
sequence encoding a mature form of the polypeptide in the expression construct is 
operably linked to a DNA sequence encoding a signal sequence. Preferably the signal 
. 25 sequence is native (homologous) to the DNA sequence encoding the polypeptide. 
Alternatively the signal sequence is foreign (heterologous) to the DNA sequence 
encoding the polypeptide, in which case the signal sequence is preferably endogenous 
to the host cell in which the DNA sequence is expressed. Examples of suitable signal 
sequences for yeast host cells are the signal sequences derived from yeast a-factor 
30 genes. Similarly, a suitable signal sequence for filamentous fungal host cells is e.g. a 
signal sequence derived from a filamentous fungal amyloglucosidase (AG) gene, e.g. 
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the A.niger glaA gene. This may be used in combination with the amyloglucosidase 
(also called (gluco)amylase) promoter itself, as well as in combination with other 
promoters. Hybrid signal sequences may also be used with the context of the present 
invention. 

5 Suitable heterologous secretion leader sequences are those originating from the 

fungal amyloglucosidase (AG) gene (glaA ~ both 18 and 24 amino acid versions e.g. 
from Aspergillus), the a-factor gene (yeasts e.g. Saccharomyces and Kluyveromyces) 
or the a-amylase gene (Bacillus). 

The vectors may be transformed or transfected into a suitable host cell as 

**■ 

10 described above to provide for expression of a polypeptide of the invention or for 
production of vitamin B12. This process may comprise culturing a host cell 
transformed with an expression vector as described above under conditions to 
provide for expression by the vector of a coding sequence encoding the polypeptide. 
A further aspect of the invention thus provides host cells transformed or 

15 transfected with or comprising a polynucleotide or vector of the invention. 

Preferably the polynucleotide is carried in a vector for the replication and expression 
of the polynucleotide. The cells will be chosen to be compatible with the said vector 
and may for example be prokaryotic (for example bacterial), fungal, yeast or plant 
cells. 

20 The invention encompasses processes for the production of the polypeptide of 

the invention by means of recombinant expression of a DNA sequence encoding the 
polypeptide. For this purpose the DNA sequence of the invention can be used for 
gene amplification and/or exchange of expression signals, such as promoters, 
secretion signal sequences, in order to allow economic production of the polypeptide 

25 in a suitable homologous or heterologous host cell. A homologous host cell is herein 
defined as a host cell which is of the same species or which is a variant within the 
same species as the species from which the DNA sequence is derived. 

Suitable host cells can be prokaryotic microorganisms such as bacteria, or 
eukaryotic organisms, for example fungi, such as yeasts or filamentous fungi, or plant 

30 cells. In general, yeast cells are preferred over fungal cells because they are easier to 
manipulate. However, some proteins are either poorly secreted from yeasts, or in 
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some cases are not processed properly (e.g. hyperglycosylation in yeast). In these 
instances, a fungal or bacterial host organism can be selected. For production of 
vitamin Bi 2 , a prokaryotic or bacterial host is preferred. 

The host cell may over-express the polypeptide, and techniques for engineering 
5 over-expression are well known 3 . The host may thus have two or more copies of the 
encoding polynucleotide (and the vector may thus have two or more copies 
accordingly). 

Bacteria from the genus Bacillus are. suitable heterologous hosts because of 
their capability to secrete proteins into the culture medium. Other bacteria suitable as 
10 hosts are those from the genera Streptomyces and Pseudomonas. However, preferably 
the host is from the same order (e.g. Actinomycetales) or family 
(Propionibacteriaceae) as the bacteria from which the polynucleotides of the 
invention can be obtained from (P.Jreudenreichii). 

A preferred yeast host cell for the expression of the DNA sequence encoding the 
15 polypeptide is of the genera Saccharomyces, Kluyveromyces, Hansenula, Pichia, 
Yarrowia, and Schizosaccharomyces, More preferably a yeast host cell is selected 
from the group consisting of the species Saccharomyces cerevisiae, Kluyveromyces 
lactis (also known as Kluyveromyces marxianus var. lactis), Hansenula polymorpha, 
Pichia pastoris, Yarrowia lipolytica, and Schizosaccharomyces pombe. 
20 Most preferred are, however, (e.g. filamentous) fungal host cells. Preferred 

. filamentous fungal host cells are selected from the group consisting of the genera 
Aspergillus, Trichoderma, Fusarium, Disporotrichum, Penicillium, Acremonium, 
Neurospora 9 Thermoascus, Myceliophtora 9 Sporotrichum, Thielavia, and 
Talaromyces. More preferably a filamentous fungal host cell is of the species 
25 Aspergillus oyzae 9 Aspergillus sojae, Aspergillus nidulans, or a species from the 
Aspergillus niger Group. 23 These include but are not limited to Aspergillus niger, 
Aspergillus awamori, Aspergillus tubingensis, Aspergillus aculeatus, Aspergillus 
foetidus, Aspergillus nidulans, Aspergillus japonicus, Aspergillus oryzae and 
Aspergillus ficuum, and further consisting of the species Trichoderma reesei, 
30 Fusarium graminearum, Penicillium chrysogenum, Acremonium alabamense, 
Neurospora crassa, Myceliophtora thermophilum, Sporotrichum cellulophilum, 
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Disporotrichum dimorphosporum and Thielavia terrestris. 

Examples of expression hosts within the scope of the present invention are fungi 
such as Aspergillus species 31,32 and Trichoderma species; bacteria such as Bacillus 
. species 33, 34 , e.g. Bacillus subtilis, Bacillus licheniformis, Bacillus 
5 amyloliquefaciens, Pseudomonas species; and yeasts such as Kluyveromyces species 
35 e.g. Kluyveromyces lactis? 6 and Saccharomyces species, e.g. Saccharomyces 
cerevisiae. 

Culture of host cells and recombinant production: [cob A or cbiAJ 

10 

The invention also includes cells that have been modified to express the 
polypeptides of the invention. Preferably the host will have at least two, or multiple, 
copies of the polynucleotide. Such cells include transient, or preferably stable higher 
eukaryotic cell lines, such as mammalian cells or insect cells, lower eukaryotic cells, 
15 such as yeast and (e.g. filamentous) fungal cells or prokaryotic cells such as bacterial 
cells (e.g of the order Actinomycetales). 

It is also possible for the proteins of the invention to be transiently expressed in 
a cell line or on a membrane, such as for example in a baculovirus expression system. 

According to the present invention, the production of the polypeptide of the 
20 invention can be effected by the culturing of microbial expression hosts, which have . 
been transformed with one or more polynucleotides of the present invention, in a 
conventional nutrient fermentation medium; 

The recombinant host cells according to the invention may be cultured using 
procedures known in the art. For each combination of a promoter and a host cell, 
25 culture condition are available which are conducive to the expression the DNA 

sequence encoding the polypeptide and/or vitamin B t2 production. After reaching the 
desired cell density or titre the culture can be stopped and the polypeptide, or 
vitamin, is recovered using known procedures. 

The fermentation medium can comprise a carbon source (e.g. glucose, maltose, 
30 molasses, cellulose, J3-glucan etc.) and an (inorganic) nitrogen source (e.g. 

ammonium sulphate, ammonium nitrate, ammonium chloride, etc.) and/or an 
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(organic) nitrogen source (e.g. yeast extract, malt extract, peptone, etc.). An inorganic 
nutrient source (e.g. phosphate, magnesium, potassium, zinc, iron, etc.) and/or an 
inducer (e.g. cellulose, J3-glucan, maltose or maltodextrin) may be present. 
The selection of the appropriate medium may be based on the choice of 
5 expression host and/or based on the regulatory requirements of the expression 
construct. Such media are known to those skilled in the art. The medium may, if 
desired, contain additional components favouring the transformed expression hosts 
over other potentially contaminating microorganisms. 

The fermentation can be performed over a period of 0.5-30 days. It may be a 

10 batch, continuous or fed-batch process, suitably at a temperature in the range of 
between 0 and 45°C and, for example, at a pH between 2 and 10. Preferred 
fermentation conditions are a temperature in the range of between 20 and 37°C 
and/or a pH between 3 and 9. The appropriate conditions are usually selected based 
on the choice of the expression host and the protein to be expressed. 

15 After fermentation, if necessary, the cells can be removed from the fermentation 

broth by means of centrifugation or filtration. After fermentation has stopped or after 
removal of the cells, the polypeptide of the invention may then be recovered and, if 
desired, purified and isolated by conventional means. 

20 D. Uses of the polypeptides in the biosvnthetic pathway and to produce vitamin Bn 
(reaction/enzyme") 

A Amidation ((amide) synthase) 

25 An additional aspect of the invention relates to a process of amidation, or for the 

preparation of an amine, the process comprising contacting a substrate with a 
polypeptide of the invention. The process therefore includes amidating a substrate. 
The polypeptide is preferably a synthase, such as an amide synthase. It may be a 
polypeptide having the sequence of SEQ ID No. 2, or a variant or fragment thereof, 

30 as defined earlier, such as in the first aspect. Alternatively, the polypeptide may be a 
synthase from a bacterium of the family Mycobacteriaceae, for example of the genus 
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Propionibacterium, in particular the species Propionibacteriumfreudenreichii. 

The process may be conducted in the presence of glutamine. The glutamine may 

be converted to glutamate in the reaction. The polypeptide maybe capable of 

converting a hydroxyl group into an amine, or a carboxyl group (COOH) into a . 
5 carboxyamide group (CONH 2 ). The product resulting from the process may 

therefore be a primary amine. 

The process may be repeated, since the polypeptide may amidate the substrate 

twice, in other words create two (preferably primary) amine groups on (different) 

substituents of the substrate. Therefore, the process may involve converting a first 
10 carboxyl group to a carboxyamide group. The process may be repeated, and a second 

carboxyl group may be converted to a carboxyamide group as well. In this way, this 

process may involve amidation twice, for example the creation of two separate (e.g. 

primary) amines. The (second) amidation preferably takes place at a different 

substituent (e.g. carboxyl group) on the substrate. 
15 Preferably, the substrate is cobyrinic acid or cobyrinic acid c-amide and/or the 

, product is cobyrinic acid c-amide or cobyrinic-a,c-diamide. In this reaction, 

glutamine can be converted to glutamate. The amount of glutamine added or present 

may be approximately twice as much as the cobyrinic acid (in other words, glutamine 

is at a molar concentration of about double the molar concentration of cobyrinic 
20 acid). This is because the cobyrinic acid is first amidated to cobyrinic acid c-amide, 

which acts as an intermediate, and the cobyrinic c-amide is then amidated in a second 

amidation reaction to give cobyrinic a,c-diamide. 

Hence, the polypeptide in this process is preferably a cobyrinic acid a,c-diamide 

synthase (e.g. cobA or cbiA). The polypeptide may have an activity within EC 
25 6.3.1.-. 

Bl (Phosphotransferase (phosphorylation): fcobUJ 

The invention also relates to a process of phosharylation, or for the preparation 
30 of a phosphate-containing compound, the process comprising contacting a substrate 
with a polypeptide of the invention. The polypeptide preferably comprises SEQ ID 
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No. 4, or a variant or fragment thereof, as defined earlier. Alternatively, the 
polypeptide may be a phosphotransferase from a bacterium of the family 
Mycobacteriaceae, for example of the genus Propionibacterium, in particular the 
species Propionibacterium freudenreichiu This process thus comprises 
5 phosphorylating (or attaching a phosphate group to) a substrate. 

The process may be conducted in the presence of a nucleotidyl (e.g. tri) 
phosphate, such as ATP. The substrate may comprise a nucleotide, such as one 
including adenosine. 

Preferably the process comprises transfer (from one compound to another, such 
10 as to the substrate) of a phosphate moiety, such as phosphorylation of a hydroxyl 
(OH) group to form a phosphate group (-PCT 4 ). The polypeptide may thus act as a 
phosphotransferase with an alcohol group (such as a hydroxy group) as an acceptor. 

Preferably the substrate is adehosyl cobinamide (Formula IT) and/or the product 
is adenosyl cobinamide phosphate (Formula HA). The process may additionally 
15 comprise the conversion of a nucleotide triphosphate to a nucleotide diphosphate (for 
example, ATP to ADP). The polypeptide may therefore be a cobinamide kinase (e.g. 
cobU). Preferably, it has an activity within EC 2.7. 1 .-. 

B2 (Nucleotidyl) transferase (nucleotidylation) 

20 

In this aspect the invention relates to a process of nucleotidylation, or for the 
preparation of a nucleotidyl-containing compound, the process comprising contacting 
a substrate with a polypeptide that is a nucleotide transferase. This polypeptide may 
be the same as that described in Bl above. This is because the (second) enzyme 
25 (designated B) has a dual function, and is a bifunctional enzyme. 

Thus enzyme B can act as a general transferase, transferring both phosphate 
groups as well as nucleotidyl groups. It can therefore act as both a 
phosphotransferase (Bl) and as a nucleotidyl transferase (B2). 

The second function, designated B2, relates to the activity of the enzyme as a 
30 nucleotidyl-containing transferase. 

Thus, preferably the process involved nucleotidylating a substrate, such as 
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guanidylating (a substrate). Preferably, the substrate will comprise at least one 
phosphate group. Suitably, the polypeptide is able to nucleotidylate a phosphate 
group. 

The process may take place in the presence of a nucleosyl (e.g. tri) phosphate, 
5 for example GTP). Thus in this process the polypeptide preferably catalyses the 
guanidylation of a phosphate group. 

Preferably the substrate is adenosyl cobinamide phosphate (Formula HA) and/or 
the product is adenosyl-GDP-cobinamide (Formula IIB). Thus the enzyme can 
catalyse the formation of a nucleotidyl-containing compound, suach as adenosyl- 
10 GDP-cobinamide, from a substrate such as adenosyl cobinamide phosphate. The 
' polypeptide may thus be a (nucleotidyl) transferase, or has an activity within EC 
2.7.7.-. Other preferred features of the polypeptide are as described in the previous 
section concerning phosphotransferase activity (Bl). 

15 C Arylation (aryl transferase) or ribazole addition: fcobSJ 

This process of the invention comprises arylation, or the preparation of an aryl- 
containing compound, the process comprising contacting a substrate with a 
polypeptide of the invention, preferably an (e.g. aryl) transferase. Preferably the 
20 polypeptide comprises SEQ ID No. 6, or a variant or fragment thereof as previously 
defined. Alternatively, the polypeptide may be an aryltransferase from a bacterium of 
the family Mycobacteriaceae, for example of the genus Propionibacterium, in 
particular the species Propionibacterium freudenreichiL This process thus comprises 
arylating a substrate. 

25 The aryl moiety (e.g. in ribazole, such as is transferred during a reaction) may 

comprises an aromatic ring system. The aryl moiety may comprise one or two 
aromatic rings. The ring system may be substituted by from one to four Ci-s alkyl 
groups. The aryl moiety may comprise none, one or two heteroatoms, for example 
one or two nitrogen atoms. Preferably the aryl moiety comprises a benzimidazole 

30 ring. The process therefore may comprise the preparation of a . (e.g. dimethyl) 
benzimidazole (DMB)-containing compound. 
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The aryl group maybe bonded or joined to a (central) metal, e.g. cobalt, atom. 
Alternatively or in addition, the aryl group may be bonded to a carbon atom, such as 
in a ribose group. Preferably the aryl moiety is bonded to both cobalt atom and a 
ribose group (in the product of the reaction, namely the resulting benzimidazole- 
5 containing compound). 

The process may take place in the presence of a ribazole, such as oc-ribazole. 
This may be present in an approximately equimolar amount to the substrate. The 
reaction may comprise alpha-ribazole addition (to the substrate). 

Preferably the substrate is adenosyl-GDP-cobamide. The product of the 
10 reaction, the aryl-containing compound, is preferably adenosyl-5,6-dimethyl 

benzimidazolyl cobamide (Formula IIC). In the process of the reaction, the ribazole 
may be converted to GMP. 

Preferably the polypeptide used in this process is a cobalamin (5 5 -phosphate) 
synthase (e.g. cobS). The polypeptide may have the activity EC 2.7.8.-. 

15 

D Adenosylation (adenosyl transferase) 

This process of the invention relates to* adenosylation, or the preparation of an 
adenosine-containing compound, the process comprising contacting a substrate with 
20 a polypeptide of the invention, preferably a transferase, such as an adenosyl 

transferase. The polypeptide preferably comprises SEQ ID No. 7, of fragment or 
variant thereof, as previously defined. Alternatively, the polypeptide maybe a 
transferase from a bacterium of the family Mycobacteriaceae, for example of the 
genus Propionibacteriwn, in particular the species Propionibacterium freudenreichii. 
25 This process thus comprises adenosylating a substrate. 

This process may therefore comprise the transfer of adenosine, preferably to the 
substrate. Preferably the adenosine becomes bound to a metal atom, such as a 
transition metal (such as of the first series), for example cobalt. 

The substrate (and/or product) maybe an amide, such as a diamide. Preferably 
30 the substrate is cobyrinic acid a, c diamide and/or the product is adenosyl cobyrinic 
acid a, c diamide. 
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The process may take place in the presence of a nucleosyl (e.g. tri) phosphate, 
such as ATP. It may also take place in the presence of adenosine. Preferably, both 
the adenosine and the nucleosyl phosphate are present in approximately equimolar 
amounts to the substrate. The nucleosyl triphosphate may be converted to a 
5 nucleosyl diphosphate. 

Preferably the polypeptide in this process is an adenosyl transferase. It may 
have an activity within EC 2.5.1.7. Preferably, the polypeptide is a transferase that is 
capable of transferring alkyl or aryl groups, other than methyl groups. Polypeptides 
that methylate, or cause methylation, may thus be excluded. 

10 

Substrates (or products of catalysed reaction): vitamin Bn intermediates and/or 
precursors 

The substrate and/or product preferably comprises a corrin core or ring 

15 system. Preferably, it comprises an aryl ring system, with up to four rings (which 
may be the same or different). Preferably however there are four rings, and they are 
the same. Each ring may contain one or two heteroatoms, for example one nitrogen 
atom. The ring maybe pentagonal. Thus preferred rings are pyrrole, and therefore 
the ring system preferably comprises a tetrapyixole system. Preferably two of the 

20 pyrrole rings are joined to each other, and the other two pyrrole rings are joined by a 
bridge, such as by methene units. 

The ring system may comprise a metal atom, for example at its core. This 
metal may be a transition metal, for example of the first series (group Vm). It may 
be of period four. Preferably the metal is cobalt, and this may be a single central 

25 cobalt atom. The ring system may have attached to it an amide, phosphate, guanidyl 
or adenosyl moiety or group. In addition to the metal atom and ring system, there 
may be a fifth, and optionally even a sixth substituent, for example bound to the 
metal. The or each substitutuent may be above and/or below the plane of the ring, as 
applicable. Where appropriate (and in the case of vitamin Bi 2 ), one of these 

30 substituents may comprise a nucleosyl group, such as a dioxy nucleosyl, preferably 
5*-dioxy adenosine. Another substituent may be an aryl group as defined above in 
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section C concerning arylation. This substituent may therefore preferably comprise 
dimethoxybenzimidazole. In substrates employed in the invention, one or both 
substituents may be present, and so that the cobalt atom may have, as fifth and sixth 
substituents, a 5'-dioxy adenosine and a dimethoxy benzimidazole group. The same 
5 preferred features apply to the product of the reaction as they do to the substrate. 
A list of the substrates and products of the five catalysed reactions, or 
biosynthentic steps, is provided below after the role of the enzymes in* the vitamin 
B 12 biosynthetic pathway. . 

10 A(cobA) . B(cobU) C (cobS) 

Uro HI— > -> -» Cobinamide — >cobinamide-P^ Vitamin B12 



15 



20 



Formula 
Enzyme 

I 

IA 
IB 
IC 



25 



Common name 

cobyrinic acid 

cobyrinic acid 
c-amide 

cobyrinic acid 
a,c-diamide 

4 

adenosyl 

cobyrinic acid-a,c-diamide 



Reaction / Biosynthetic step: 



amide synthase 



D 



amide synthase 
(adenosyl) transferase 



30 



35 



(cobU) 
HA 



(cobU) 

iib 



adenosyl 
cobinamide 

4 

adenosyl 

cobinamide phosphate 

4 

adenosyl-GDP 



B 1 (phospho) transferase 



B2 • (nucleotidyltransferase 
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cobamide 



transferase (cobS) 
EC 



(benzimidazolyl) 



adenosyl-5,6 

dimethyl benzimidazolyl 

cobamide 

(vitamin B 12) 



Preferred intermediates or precursors include cobinamide, cobinamide-P 
(phosphate) or 

compounds of Formula 1A, IB, II, DA, IIB or IIC. 

Hence the process of the present invention may comprise one or more of the 

following process steps (illustrated using chemical formulae), namely: 



Formula I 



C0 2 H 



15 




C0 2 H 



CQ 2 H 



I (amide) synthase (A) 

(SEQ ID No. 2, or variants, etc thereof) 



20 
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I (amide) synthase (A) 

(SEQ ID No. 2, or variants, etc thereof) 



Formula IB 




WO 2004/011635 PCT/EP2003/008216 

40 

4 adenosyl transferase (D) 

(SEQ ID No; 8, or variants, etc thereof) 

Formula IC 

C0 2 H 




5 

Formula II 




CONH 2 



OH 
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I (phospho) transferase (Bl) 
(SEQ ID No. 4, or variants, etc. thereof) 



Formula HA 




CONH 2 



I (nucleotidyl) transferase (B2) 

(SEQ ID No. 4, or variants, etc. thereof) 



PCT/EP2003/008216 




^ (benzimidazolyl) transferase (C) 
(SEQ ID No. 6, or variants, etc. thereof) 
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Formula IIC ( includes vitamin B 12 ) 




CONH 2 



OH 



E. Industrial preparation of vitamin Bi? 

5 

As described- above, the polypeptides. of the.invention may be used to perform 
one or more biosynthetic steps in the vitamin B\ 2 preparatory pathway. The 
polypeptide may be contacted with the appropriate substrate and the reaction allowed 
to take place. This may be allowed to occur when the polypeptide is outside a (e.g. 
10 host) cell, in other words the polypeptide is mixed with the substrate, for example in 
vitro. However, it is considerably more effective to use a host cell of the invention, 
comprising one or more of the polypeptides of the invention, in order to either 
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perform one or moreparticularly desired steps in the vitamin B i2 biosynthetic 
pathway or, preferably, use the host cell to produce vitamin Bi 2 (or an intermediate or 
precursor thereof). 

5 The invention therefore also relates to a process for preparing vitamin B12 (or a 

precursor thereof), the process comprising culturing one or more host cells of the 
invention under conditions that allow the cells to biosynthesise (and hence produce) 
vitamin B12 (or the precursor). 

This process may take place in a fermenter. The fermenter may be equipped 

10 with agitation means, for example a stirrer. The fermenter vessel may also be 

equipped with aeration means, such as a means of causing an oxygen-containing gas 
to be contacted with a liquid in the fermenter. The liquid will usually be the culture 
broth, consisting of an aqueous suspension of the cells. Fermentation may then be 
allowed to take place. The fermenter may have a minimum volume of 10, 50, 100 or 

15 1,000 litres. 

The cells may be supplied with one or more carbon and/or nitrogen sources, 
before fermentation begins, at the start of fermentation, or continuously or 
continually during fermentation. 

At the end of the fermentation, the supply of the carbon or nitrogen sources is 

20 stopped, or one or more of these sources may be used up. Each of the carbon and/or 
nitrogen sources may be a complex source or an organic or inorganic compound. 

The cells may then be removed from the fermenter. Before or after this, water 
(or an aqueous liquid) may be removed from the cells. The cells may then be heated 
or pasteurised, in order to kill them. 

25 Methods of extracting or isolating vitamin Bi 2 from microbial cells are well 

known 45 . For example (in order to be able to obtain the vitamin B12) it is preferred 
that the host cells, having produced the vitamin, are broken, or at least partially 
opened, so that at least part of the soluble content of the cells (comprising vitamin 
B12) is released into a liquid, for example a liquid in which the cells are contained. 

30 One can then separate the open or broken cells, or resulting cell debris, from the 
liquid (comprising the vitamin B12). The microbial cells (containing vitamin B12) 
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can thus be treated so as to cause disruption of the cell membranes. Suitable 
treatments for opening the cells include heat treatment, such as pasteurisation, 
heating in an autoclave, treatment with bacteriolytic enzymes (such as lysozyme), " 
and/or mechanical disruption cells (grinding, or the use of shear forces), or treatment 
5 of chemicals (to cause cell lysis, for example the use of detergents or organic 
• solvents). 

The process of lysis or other membrane disruption can produce a lysate, 

which can then be separated into solid and liquid phases. The solid phase of the 

lysate, comprising the cell debris, can then be separated (from the liquid containing 
10 vitamin B12). A number of suitable solid-liquid separation techniques are available, 

including centrifugation and/or filtration. Preferably however, solid liquid separation 

is performed using ultrafiltration. 

Preferably, the open/broken microbial cells are washed, and the washings are 

then combined or added to the (vitamin B12 containing) liquid, separated from cell 
15 debris. Suitably washing comprises diafiltration,suitably with ion-free water. The 

(vitamin B 12) containing diafiltrate can then be combined with the (vitamin B12 

containing) liquid phase. 

The vitamin B12 containing liquid(s) rriiay then be subjected to drying, for 

example spray-drying, fluid-bed drying, freeze drying or drying in a vacuum. 
20 Preferably, cells producing vitamin B12 are washed prior to opening (lysis), 

since this may increase the vitamin B12 concentration on dry matter by removing 

medium components. This can be performed using diafiltration, preferably using 

ion-free water. 

Preferred features and characteristics of one aspect of the invention are 
25 applicable to another aspect mutatis mutandis. 

The invention will now be described with reference to the following Examples 
which are intended to be illustrative only and not limiting. 

EXAMPLE 1 

30 

Propionibacterium vectors 
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Two strains (P. Jreudenreichii LMG16545 and P. Jreudenreichii LMG16546) 
were used which both show an identical plasmid profile of 2 plasmids. One plasmid 
was large (size not determined) and the other was smaller, more abundantly present 
and had a size of 3.6 kb. These 3.6 kb plasmids from LMG16545 and LMG16546 
5 were chosen for further use in vectors. Plasmids from these vectors already are 
described. 29 Expression systems in Pfopionibacteria are known in the art 30 . 

Construction of E. coli/Promonibacterium shuttle vectors 

A 1.7 kb Acc65l fragment from the SaccharopolyspoYa erythraea NRRL2338 

10 erythromycin biosynthesis cluster and containing the erythromycin resistance 

conferring gene 37,38 was inserted into Acc65l linearized pBR322AI 29 . Then the newly 
derived construct, named pBRES, was linearized with EcoRV and ligated to p545 
DNA that had been digested with BsaBL E.coli transformants were found to harbor a 
vector with the correct insert, in both orientations. The resulting plasmid vectors were 

15 named pBRESP36Bl and pBRESP36B2 (see Figs. 2a and 2b 29 ). 

t Plasmid vector constructs were also obtained with p545 DNA linearized in an 
other restriction site situated outside the putative replication region, namely AlwNl. 
For this construction the pBRES vector had to be provided with a suitable cloning 
site. An adaptor was designed and annealing of the required oligo's created a double 

20 stranded DNA fragment with Acc65I and BglTL cohesive ends respectively, which 
moreover contains an internal SJil restriction site, that provides ends compatible to 
the AlwNl digested p545 plasmid. This adaptor was cloned in pBRES between the 
BgttL and the proximal Acc65I site. The pBRES-Sfi vector thus obtained was 
subsequently digested by SJil and ligated to AlwNl digested p545. Transformation of 

25 E.coli yielded transformants with the correct vector as confirmed by restriction 
enzyme analysis. The vector obtained was named pBRESP36A 29 . 

Transformation of Propionibacterium with E. coli/ Propionibacterium shuttle vectors 
Transformation of Propionibacterium Jreudenreichii strain ATCC6207 with 
30 pBRESP36Bl will be described. 

The bacterial cells are cultivated in SLB (sodium lactate broth 39 at 30°C to a 
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stationary growth phase, and subsequently diluted 1:5.0 in fresh SLB. After 

incubation at 30°C for around 20 hours, cells (now in the exponential growth phase) 

were harvested and washed extensively in cold 0.5M sucrose. Subsequently cells 

were, washed once in the electroporation buffer, consisting of 0.5M sucrose, buffered 
5 by ImM potassium acetate, pH5.5, and finally resuspended in this electroporation 

buffer in about 1/100 of the original culture volume. Cells were kept on ice during 

the whole procedure. 

For the electroporation (apparatus from BIORAD), 80- 100 jol of cell 

suspension was mixed with ± 1 jag of DNA (or smaller amounts), in a cooled 1 or 2 
10 . mm electroporation cuvette, and an electric pulse delivered. Optimal pulse conditions 

were found to be 25kV/cm at 200 Q resistance and 25jjP capacitance. However, 

lower and higher voltages (also at 100H) also yield transformants. 

Immediately after the pulse, 900 jil cold SLB containing 0.5 M sucrose was 

added to the pulsed cell suspension and these are subsequently incubated for 2.5 to 3 
15 hours at 30°C before plating appropriate dilutions on SLB/agar plates containing 0.5 

M sucrose and lOjxg/ml erythromycin. After a 5 to 7 day incubation period at 30°C 

under anaerobic conditions, transformants were detected. 

DNA isolated from E. coli DH5a (Promega) yielded a transformation efficiency 

of 20 - 30 transformants per jag DNA. A 10-100 fold higher efficiency is achieved 
20 when DNA is isolated from E. coli JM1 10 (dam", dcm" strain). E. coli transformation 

was done according to BIORAD's instructions. 

Transformants contained the authentic vectors, indistinguishable from the 

original plasmid DNA used for transformation of ATCC6207. This was shown by 

restriction enzyme analysis of plasmid DNA isolated from the transformants by the 
25 small scale plasmid DNA isolation procedure refered to before. 

Vectors were exclusively present as autonomously replicating plasmids. 

Southern blot hybridization 40 with total DNA isolates showed that chromosomal 

DNA did not hybridise to the vector DNA used as a probe, indicating that no 

chromosomal integration of plasmid DNA occured. 
30 Transformation was also successful with vectors pBRESP36B2 and 

pBRESP36A, indicating that functionality of the vector was independent of the 
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orientation of p545 or the cloning site used. Also in this case the authenticity of the 
vectors was confirmed. 

Moreover, transformation of P. freudenreichii strain ATCC6207 with DNA 
isolated from a Propionibacterium transfonnant resulted in a 10 5 -10 6 fold increased 
5 transformation efficiency as compared to that obtained with DNA isolated from E. 
coli DH5a. 

Transformation of another P. freudenreichii strain, LMG16545 (the same strain 
from which the p545 plasmid was obtained), resulted in a transformation efficiency 
comparable to that of the ATCC strain. 
10 The transformations we repeated using each of SEQ ID Nos. 1,3,5 and 7 

operably linked to appropriate transcription and translation start signals in the shuttle 
vector. 

EXAMPLE 2 

15 

Construction of plasmid vector containing the amide synthase (A) sene 

The construction and application of a plasmid vector to increase the level of 
vitamin B12 (cobalamine) synthesis in P. freudenreichii strain ATCC6207 will be 
described: 

20 For construction of gene overexpression plasmids use was made of the 16S 

rRNA promoter from P. freudenreichii. One of the strategies to test promoter bearing 
fragments is the use of a promoter-probe vector. The reporter gene used to monitor 
promoter activity encodes ah easily detectable enymatic activity that is not present in 
the wild type strain. The cat (chloramphenicol acetyl transferase) gene from 

25 pACYCl 84 17 ' 42 was used for construction of a promoter-probe vector. In order to 

analyse the activity of the 16S rRNA promoter this promoter was placed upstream of 
the cat gene. 

For construction of the promoter-probe vector, the promoterless cat gene was 
cloned in the 

30 E. coli /Propionibacterium shuttle vector pBRESP36B2 29 by PCR, resulting in 
vector pB2/PoCAT. The upstream PCR primer included the sequence 5- 
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GGGATCCTCTAGAGCATGCAAGCTTCTCGAGAATCGATAGATCTCTAAGG 
AAGCTAAAATG-3 1 , in which the last three nucleotides indicate the start codon of 
the cat gene. This synthetically derived sequence includes a multi-cloning site (MCS) 
containing the restriction sites BamHI, Xbal, HindlH, SphI, Xhol, Clal and BglH The 
5 downstream PCR primer included a'BamHI restriction site. After PCR amplification 
the cat gene was cloned as a BamHI fragment in the Bgin site of the vector (BamHI 
and BgUI sites not restored). Two orientations of the cat gene were obtained. The 
orientation in which the cat gene has the same orientation as the beta lactamase gene 
in the pBR322 segment was used in further experiments. 

10 On the basis of the sequence of 1 6S rRNA 'from Propionibacterium 

freudenreichii ATCC6207 (GenBank accession number X53217) an appropriate 
restriction enzyme was chosen (HindE) and appropriate PCR primers were designed 
that enabled the amplification of an approximately 3 kb region encompassing the 
promoter by inverse PCR 2 . From the PCR fragment a 0.6 kb Sphl-HmdHI fragment 

15 directly upstream of the 16S rRNA coding sequence was isolated. This fragment was 
ligated in pB2/PoCAT digested with the same enzymes resulting in a plasmid named 
pB2/PrRNACAT. After transformation of E. coli chloramphenicol resistant 
transformants were obtained. After transformation P. freudenreichii strain ATCC 
6207 colonies were only obtained on erythromycin containing plates, not on 

20 chloramphenicol containing plates. However, when streaked on chloramphenicol 
containing plates transformants containing pB2/PrRNACAT grew whereas 
ATCC6207 cells containing pB2/PoCAT did not grow, thus indicating the 
functionality of the 16S rRNA in P. freudenreichii 

An expression vector containing the 16S rRNA promoter but lacking the cat 

25 gene was constructed by ligation of the approx. 700 bp BamHI - Bglll fragment from 
vector 

pB2/PrRNACAT, containing the 16S ribosomal KNA promoter from P. 
freudenreichii, into the unique Bgin of pBRESP36B2. Both possible orientations of 
the promoter element in the vector were obtained. In case transcription of genes 
30 expressed by this ribosomal promoter was not properly terminated, readthrough may 
obstruct transcription of the two replication genes of the Propionibacterium replicon 
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if the ribosomal promoter is oriented towards these replication genes. Therefore the 
vector in which the promoter was cloned in the opposite orientation, 
pBRESP36B2pl6SH, was used in further experiments. A unique Bgin site 
downstream from the promoter was used for cloning of the expression library. 

5 

EXAMPLE 3 

The complete coding sequence of adenosyl transferase (D, SEQ ID No. 7) was 
generated by PCR from appropriate primers. The upstream PCR primer included a 5' 
10 extension including a Bgin restriction site and a ribosome binding site upstream of 
the start codon of the gene. The downstream PCR primer included a 5' extension 
including a BglH restriction site. After digestion of the amplified fragment with Bgin 
the fragment was ligated in the vector pBRESP36B2pl6SH and digested with Bgin 
and dephosphorylated to remove 5' phosphate groups. After transformation of E. coli 
15 ampicillin resistant colonies were obtained. Both orientations of the cloned fragment 
releative to the vector were observed. However, only the construct in which the 16S 
rRNA promoter is located directly upstream of the ribosome binding site allows the 
expression of the adenosyl transferase gene. 

The latter construct was transformed to P. freudenreichii ATCC6207 as described 
20 before. In the transformants the level of vitamin Bi 2 synthesis was determined as 
follows. Frozen cultures of Propionibacterium transformants, as well as a control 
strain containing only the vector plasmid pBRESP36B2, were inoculated in 100 ml 
flasks containing 50 ml of BHI (Brain Heart Infusion) medium (Difco) and incubated 
for 72 hrs at 28°C without shaking. From this preculture 4 ml were transferred to 200 
25 ml of production medium consisting of Difco yeast extract 15 g/1, Na lactate 30 g/1, 
KH2PO4 0.5g/l, M11SO4 0.01 g/1, and CoCl 2 0.005 g/1 in a 500 ml shake flask and 
incubated at 28°C for 56 hrs without shaking, followed by 48 hrs in a New 
Brunswick rotary shaker at 200 rpm. 

Vitamin B I2 titres were measured using HPLC 43 and showed a higher vitamin Bi 2 
.30 production than the control strain. 

The process was repeated for each of the other three genes namely: 
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A: SEQ ID No. 1, cobyrinic acid -a, c-diamide. synthase; 
B: SEQ ID No. 3, cobinamide kinase gene; and 
C: SEQ ID No. 5, cobalamin (5' -phosphate) synthase. 
The four genes (A, B, C, and D) were then combined in one operon to further - 
5 increase vitamin 

B12 production. 

The resulting transformed cells (the P . freudenreichii) ATCC 6207) was 
cultured in a fennenter using a known technique 44 : hi order to kill the cells, and to 
cause lysis, the broth was pasteurised at 65°C for thirty minutes. The broth was then 
10 subjected to ultrafiltration, and a pink coloured filtrate was obtained, containing 
vitamin Bi 2 . The heat had caused lysis of the cells, and therefore release of the 
intracellular vitamin Bi 2 in the medium. 

15 EXAMPLE 4 

Construction of expression vectors for Propionibacteria and its use in expression of 
enzymes of the invention and production of vitamin Bp in Provionibacterium 
freudenreichii 

20 Apart from the expression systems described above 29, 30 , two other expression 

systems are known in the art that can be used to express, in multiple copies if desired, 
genes encoding the novel enzymes of the invention. The first is pRGOl, a plasmid 
from Proprionibacterium acidipropionici 27 . This was used to create a shuttle vector 
pPK705. Such a vector was used to cany successfully the cobyrinic acid a,c-diamide 

25 synthase (A) enzyme, and thus transform Proprionibacterium freudenreichii - 
subspecies shermanii. 

The other suitable expression system that was employed used the known shuttle 
vector pPK705 , which is able to shuttle between E. coli and Proprionibacterium. 
This allowed the construction of expression vectors for Proprionibacteria, and the 

30 incorporation of enzyme B, cobinamide kinase, into Freudenreichii subspecies 

shermanii. This was achieved by using the cobinamide kinase gene, SEQ ED No. 3 
of the invention. 
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COMPARATIVE EXAMPLE 5 

Shake flask fermentation of Propionibacterium containing an empty plasmid vector 
5 The transformation of P. freudenreichii strain ATCC6207 with the E. coli I 

Propionibacterium shuttle vector pBRESP36B2pl6SH has been described in 
Example 2. Transformants were selected on erythromycin containing agar plates as 
described before and transformants containing the authentic plasmid were selected by 
restriction analysis. 

10 Transformants were grown in brown-glass penicillin flasks containing 50 ml 

of anaerobic Ml culture medium prepared as follows. 900 grams of Ml culture 
medium was prepared by adding 90 ml of part B to 8 1 0 grams of part A under 
anaerobic conditions. Erythromycin was added to a final concentration of 10 
milligram / liter. 

15 810 grams of part A consisted of 52.2 g MES, 12.51 g Difco Yeast Extract 

and 0.68 g methionine dissolved in water. The pH was adjusted to 7 with ammonium 
hydroxide and the oxygen was removed by degassing and flushing with nitrogen. 
Finally part A was sterilized in an autoclave. 

90 ml of part B consisted of 1 .5 ml trace element solution, 1 .5 ml filter 
20 sterilized CoC12.6H20 (5g/kg), 0.9 ml filter sterilized Ca-pantothenate (20g/kg) and 
86.1 ml degassed sterile glucose (440 g/kg). 

1 kg of trace elements solution contained lOg citric acid, 10 ml H2S04, 1.2 g 
MnS04.H20, 300 g MgS04.7H20, 6 g FeS04.7H20 and 2.4 g ZnS04.7H20 in 
water. The solution was filter sterilized. 
25 The 50 ml Ml medium was inoculated under anaerobic conditions with 5 ml 

_ of an pre-culture that consisted of a selected pBRESP36B2pl6SH transformant that 
had been grown anaerobically in Ml culture medium for 3 days at 30 °C with gentle 
shaking until an OD630 of approximately 10. 

The 50 ml culture was incubated for 40 hours at 30°C without shaking. At this 
30 time-point a sample was taken from the culture and the OD630 determined (=OD- 
ana). The pH of the medium was corrected through the anaerobic addition of NaOH 
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(10%). The number of microliters of NaOH (10%) needed were calculated using the 
formula: number of microlitres NaOH (10%) = (19.67 x OD-ana) - 60. As a next 
step 0.5 ml of DBI (0.5 g/kg) was anaerobically added. Following these additions the 
culture was incubated for another 24 hrs at 30°C with shaking at 200 rpm. The 
5 OD630 was determined again (=OD-end). 

1 0 ml samples of the cultures were collected. The cells were harvested by . 
centrifugation and the pellet was washed once with 1 volume sterile demineralised 
water. The pellet was stored at -20°C. Thawed samples were heated to 80°C in a 2% 
KCN containing buffer to open the cells and to stabilise vitamin B12 and precursors. 

10 Vitamin B12 titers as well as adenosyl cobinamide and adenosyl cobinamide 

phosphate precursor levels were measured using HPLC using appropriate standards 
(43). The identity of the adenosyl cobinamide and adenosyl cobinamide phosphate 
precursor peaks was established on the basis of mass spectrometry data. 

The amount of vitamin B12 produced per ml culture of pBRESP36B2pi6SH 

15 transformants was divided by OD-end. The resulting ratio obtained for 

pBRESP36B2pl6SH transformants was used as a reference for the ratios obtained 
for P. frendenreichii transformants containing other plasmids (e.g. next example). 
Similar ratios were calculated for the . amounts of adenosyl cobinamide and adenosyl 
cobinamide phosphate produced (amount per ml culture per OD-end). These ratios 

20 were used as reference for the ratios obtained for respectively adenosyl cobinamide 
and adenosyl cobinamide phosphate in P. freadenreichii transformants containing 
other plasmids. 

EXAMPLE 6 

25 

Construction of an expression vector for the Provionibacterium freudenreichii 
cobA/cbiA sene and its use for the production of vitamin B12 and its precursors in 
Propionibacterium freudenreichii 

30 A Propionibacterium freudenreichii gene coding for glutamate 1- 

semialdehyde aminomutase named hemL 46 (EMBL accession number D 12643) was 
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used. The coding region of this gene is followed by the transcriptional hemL 
terminator. A DNA fragment containing the coding region of the hemL gene and the 
hetnL terminator was cloned in pBRESP36B2pl6SH as described in the following 
steps. The DNA fragment was amplified by PCR from chromosomal DNA isolated 
5 from P. freudenreichii ATCC6207 using the following primers: 5 s -C AgTAgATCT 
CgACAAggAggAACCCAtgAg-3 5 and 5'- 

CgTAAgATCTC AgTTTCggAC ATggCAgTg-3 5 . Both primers contain a Bgffl 
restriction site. The PCR fragment obtained was ligated into the vector pCR-Blunt II- 
TOPO (Invitrogen) to create pGBPHEL-1. After transformation of E. coli kanamycin 

10 " resistant colonies were obtained. The constructs thus obtained were verified by 
restriction analysis and the DNA sequence of the fragment was verified through 
sequence analysis. 

pGBPHEL-1 was subsequently digested with BgUI and the fragment 
containing the hemL gene was isolated and ligated into the vector 

15 pBRESP36B2pl6SH which first had been digested with BgUI and dephosphorylated. 
After transformation of E< coli ampicillin resistant colonies were obtained. A 
construct in which the hemL gene is transcribed from the 16S promoter was selected 
by restriction analysis. The resulting construct was called pGBP01HEL-l . 

A P. freudenreichii gene coding for uroporphyrinogen HI methyltransferase 

20 named cobA (EMBL accession number Ul 3043) was used 47 . In order to over-express 
the cob A gene in Propionibacterium the gene was placed behind the 16S rRNA 
promoter and in front of the hemL terminator present in pGBP01HEL-l as described 
in the following steps. A DNA fragment containing the cobA coding region was 
amplified by PCR from chromosomal DNA isolated from P. freudenreichii 

25 ATCC6207 using the following primers: 5'- CACCACCAACATCgATgAggAAAC- 
3' and 5'- TCCAATTgggACTC AgTggTCgCTg-3 ' . The first primer contains a Clal 
and the second primer contains a Mfel restriction site. The PCR fragment obtained 
was ligated into the vector pCR-Blunt II-TOPO (Invitrogen). After transformation of 
E coli kanamycin resistant colonies were obtained. The DNA sequence of the cloned 

30 fragment was verified through sequence analysis. The resulting construct was named 
pGBPCOB-1. 
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pGBPCOB-1 was subsequently digested with Clal and Mfel and the fragment 
containing the cob A gene was isolated and ligated into the large vector fragment 
isolated from pGBP01HEL-l digested with the same enzymes. After transformation 
of E. coli ampicillin resistant colonies were obtained. Plasmid DNA isolated from 
5 transformants was verified by restriction analysis and the resulting construct was 
called pGBP02COB-l. 

pGBP02COB-l was transformed to E. coli JM101 and plasmid DNA isolated • 
from the obtained transformants was verified by restriction analysis; The plasmid 
DNA was subsequently transformed to P. freudenreichii ATCC6207 as described 
10 before. Transformants were selected on erythromycin containing agar plates as 
described before. The plasmid DNA isolated from the P. freudenreichii 
transformants was verified by restriction analysis. 

Shake flask fermentations of pGBP02COB-l transformants using Ml culture 
medium and the determination of vitamin B 12, adenosyl cobinamide and adenosyl 
15 cobinamide phosphate levels were performed as described in the previous Example. 
Ratios of the amount of vitB12, adenosyl cobinamide and adenosyl 
cobinamide phosphate produced per ml culture OD-end were calculated as described 
in the previous example. The ratio obtained for vitB12 was 6% lower than that 
obtained for pBRESP36B2pl6SH transformants in the previous example. However, 
20 the ratio obtained for adenosyl cobinamide was 227% higher than that obtained for 
pBRESP36B2pl6SH transformants and the ratio obtained for adenosyl cobinamide 
phosphate was 200% higher than that obtained for pBRESP36B2pl6SH * 
transformants. Thus there was a 2-fold increase in cobinamide and 3-fold increase in 
cobinamide P over the empty plasmid (Comparative Example 5). 

25 

EXAMPLE 7 

Construction of an expression vector for the Propionibacterium freudenreichii zene 
encoding vrotein B (cobU) and its use for the production of vitamin B12 and its 
30 precursors in Provionibacterium freudenreichii 
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la order to over-express the gene coding enzyme B (SEQ ID No.3) in 
Propionibacterium the gene was placed behind the 16S rRNA promoter and in front 
of the hemL terminator present in pG!BP01HEL-l as described in the following steps. 
A DNA fragment containing the coding region for B was amplified by PCR from 
5 chromosomal DNA isolated from P. freudenreichii ATCC6207 using the following 
primers: 5 9 -CTgATATCAATTggAggACATCAgATgACCCgCATCgTC-3 9 and 5'- 
CTgAATTCggCC ACgTCAgATCgCgTCC-3 ' . The first primer contains an EcoRV 
and a Mfel restriction site and the second primer contains an EcoRI restriction site. 
The PCR fragment obtained was ligated into the vector pCR-Blunt II-TOPO 

10 (Invitrogen). After transformation of E. coli kanamycin resistant colonies were 

obtained. The DNA sequence of the cloned fragment was verified through sequence 
analysis. The resulting. construct was named pGBPCOU-1. 

pGBPCOU-1 was subsequently digested with EcoRV and EcoRI and the 
fragment containing the gene coding for B was isolated and ligated into the large 

15 vector fragment isolated from pGBP01HEL-l which was first digested with Clal, 
than blunted with Klenow enzyme, and finally digested with Mfel. After 
transformation of E. coli ampicillin resistant colonies were obtained. Plasmid DNA 
isolated from transformants was verified by restriction analysis and the resulting 
construct was called pGBP02COU- 1 . 

20 pGBP02COU-l was transformed to E. coli JM101 and plasmid DNA isolated 

from the obtained transformants was verified by restriction analysis. The plasmid 
DNA was subsequently transformed to P. freudenreichii ATCC6207 as described 
before. Transformants were selected on erythromycin containing agar plates as 
described before. The plasmid DNA isolated from the P. freudenreichii 

25 transformants was verified by restriction analysis. 

Shake flask fermentations of pGBP02COU-l transformants using Ml culture 
medium and the determination of vitamin B 12, adenosyl cobinamide and adenosyl 
cobinamide phosphate levels were performed as described before. 

Ratios of the amount of vitaminB12, adenosyl cobinamide and adenosyl 

30 cobinamide phosphate produced per ml culture OD-end were calculated as described 
before. The ratio obtained for adenosyl cobinamide was 96% lower than that 
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obtained for pBRESP36B2pl6SH transformants and the ratio obtained for adenosyl 
cobinamide phosphate was 700% higher than that obtained for pBRESP36B2pl6SH 
transformants. Thus there was a 7-fold increase in the intermediate cobinamide -P 
when compared with the empty plasmid. 

EXAMPLE 8 

Construction of an expression vector for the Provionibacterium freudenreichii series 
encoding protein B and C (cob U and S) and its use for the production of vitamin 
B12 and its precursors in Provionibacterium freudenreichii 

In order to over-express simultaneously the genes encoding enzymes B (SEQ 
ID No.3) and C (SEQ ID No.5) in Propionibacterium the genes were placed behind 
the 16S rRNA promoter and in front of the hemL terminator present in pGBPOlHEL- 
1 as described in the following steps. The two genes are located next to each other on 
the P. freudenreichii genome and are part from the same operon. A DNA fragment 
containing both genes was amplified by PCR from chromosomal DNA isolated from 
P. freudenreichii ATCC6207 using the following primers: 5'- 
CTgATATCAATTggAggACATCAgATgACCCgCATCgTC-3' and 5'- 
CTgAATTCCggCggCTCAggCgAAC AATg-3 ' . The first primer contains an EcoRV 
and a Mfel restriction site and the second primer contains an EcoRI restriction site: 
The PCR fragment obtained was ligated into the vector pCR-Blunt II-TOPO 
(Invitrogen). After transformation of E. coli kanamycin resistant colonies were 
obtained. The DNA sequence of the cloned fragment was verified through sequence 
analysis. The resulting construct was named pGBPPOB-L 

pGBPPOB-1 was subsequently digested with EcoRV and EcoRI and the 
fragment containing the genes coding for B and C was isolated and ligated into the 
large vector fragment isolated from pGBP01HEL-l which was first digested with 
Clal, than blunted with Klenow enzyme, and finally digested with Mfel. After 
transformation of E. coli ampicillin resistant colonies were obtained. Plasmid DNA 
isolated from transformants was verified by restriction analysis and the resulting 
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construct was called pGBP02POB-l. 

pGBP02POB-l was transformed to E. coli JM101 and plasmid DNA isolated 
from the obtained transformants was verified by restriction analysis. The plasmid 
DNA was subsequently transformed to P. freudenreichii ATCC6207 as described 
5 . before. Transformants were selected on erythromycin containing agar plates as 
described before. The plasmid DNA isolated from the P. freudenreichii 
transformants was verified by restriction analysis. 

Shake flask fermentations of pGBP02POB-l transformants using Ml culture 
medium and the determination of vitamin B12, adenosyl cobinamide and adenosyl 
10 cobinamide phosphate levels were performed as described before. 

Ratios of the amount of vitamin B12, adenosyl cobinamide and adenosyl 
cobinamide phosphate produced per ml culture OD-end- were calculated as described 
before. The ratio obtained for vitB12 was 19% higher than that obtained for 
pBRESP36B2pl6SH transformants. The ratio obtained for adenosyl cobinamide was 
15 96% lower than that obtained for pBRESP36B2pl6SH transformants and no 
adenosyl cobinamide phosphate could be detected in the pGBP02POB-l 
transformants. 

EXAMPLE 9 

20 

Construction of ah expression vector for the Propionibacterium freudenreichii genes 
encoding protein B. C and cob A (hence cob A and U and S) and its use for the 
production of vitamin B12 and its precursors in Propionibacterium freudenreichii 



25 In order to over-express simultaneously the genes encoding enzymes B (SEQ 

ID No.3), C (SEQ ID No.5) and the cobA gene in Propionibacterium the three genes 
were placed behind the 16S rRNA promoter and in front of the hemL terminator as 
described in the following steps. pGBP02POB-l was digested with Mfel and BglH 
and the 1 .6 kb fragment was isolated. This fragment was ligated to the 9.7 kb vector 

30 fragment of pGBP02COB-l digested with the same enzymes. After transformation of 
E. coli kanamycin resistant colonies were obtained. The DNA sequence of the cloned 
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fragment was verified through sequence analysis. The resulting construct was named 
pGBPPOE-1. 

pGBP02POE-l was transformed to E. coli JM101 and plasmid DNA isolated 
from the obtained transformants was verified by restriction analysis. The plasmid 
5 DNA was subsequently transformed to P. freudenreichii ATCC6207 as described 
before. Transformants were selected on erythromycin containing agar plates as 
described before. The plasmid DNA isolated from the P. freudenreichii 
transformants was verified by restriction analysis. 

Shake flask fermentations of pGBP02POE-l transformants using Ml culture 
10 medium and the determination of vitamin B 12, adenosyl cobinamide and adenosyl 
cobinamide phosphate levels were performed as described before. 

. Ratios of the amount of vitB12, adenosyl cobinamide and adenosyl * 
cobinamide phosphate produced per ml culture OD-end were calculated as described 
before. The ratio obtained for vitB12 was 45% higher than that obtained for 
15 pBRESP36B2p 1 6SH transformants. The ratio obtained for adenosyl cobinamide was 
65% lower than that obtained for pBRESP36B2pl6SH transformants and no 
adenosyl cobinamide phosphate could be detected in the pGBP02POE-l 
transformants. 

20- SUMMARY OF EXAMPLES 

The table below displays the experimental data (the empty plasmid values are set to 
1 00% for reference) for the transformed plasmids relative to the empty plasmid 
control strain) 

25 



Example 


Gene 


Cobinamide 


Combinamide P 


Vitamin B12 






(amount/ml/OD) 


(amount/ml/OD) 


(amount/ml/OD) 


5 


Empty 
plasmid 


100 


100 


100 
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O 


cobA 


OOH T70/N 

1 yyvi. i ' /o) 




j/'t ^-o /o^ 


7 


cobU 


4(-96%) 


800 (+700%) 


94 (-6%) 


8 


cobU and S 


4 (-96%) 


0 


119 (+19%) 


9 


CobA and U 
and S 


35 (-65%) 


0 


145 (+45%) 
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